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ABSTRACT 


Let  X^ ,  X£ ,  . . .  be  independent  random  variables  with  common 
distribution  function  F,  zero  mean,  unit  variance,  and  finite  moment 
generating  function,  and  with  partial  sums  Sn>  According  to  the 
strong  law  of  large  numbers, 

S 

p  =  P{ —  >  c  for  some  n  i  m} 
m  n  n 

decreases  to  0  as  m  increases  to  00  when  c  r  c  >  0.  For 

n 

general  c^  ' s  the  Hewitt-Savage  zero-one  law  implies  that  either 

Pn  *  1  for  every  m  or  else  p^  +  0  as  m  t  00.  Assuming 

the  latter  case,  we  consider  here  the  problem  of  determining  p  up 

ID 

to  asymptotic  equivalence. 

For  constant  cr  's  the  problem  was  solved  by  Siegmund  (1975); 

in  his  case  the  rate  of  decrease  depends  heavily  on  F.  In  contrast, 

-2/5 

Strassen's  (1965)  solution  for  smoothly  varying  c  *  o(n  ) 

n 

is  independent  of  F. 

We  complete  the  solution  to  the  convergence  rate  problem  by 
considering  c^  '3  intermediate  to  those  of  Siegmund  and  Strassen. 

The  rate  in  this  case  depends  on  an  ever  increasing  number  of  terms  in 

the  Cramer  series  for  F  the  more  slowly  c^  converges  to 

zero. 


v 


CHAPTER  I 


INTRODUCTION  AND  SUMMARY 

1.1.  Basic  assumptions  and  notation. 

Throughout  this  work  we  suppose  that  X^,  X2,  ...  Is  a 
sequence  of  Independent  random  variables  with  common  distribution 
function  F.  Denote  the  random  walk  of  partial  sums  by  S: 

Sn  '  V10' 

The  distribution  F  is  assumed  to  be  standardized  in  the  sense  that 
EX  -  0,  Var  X  -  1 

where,  to  facilitate  notation,  we  have  introduced  another  random 
variable  X  distributed  according  to  F.  Assume  throughout  that  the 
moment  generating  function  (mgf)  Eexp(jX)  for  F  is  finite  for 
$  in  some  neighborhood  of  0  and  write 

(1.1.1)  K(5)  -  log(Ee^) 

for  the  cumulant  generating  function  (cgf).  This  assumption,  which 
restricts  attention  to  the  so-called  mgf  case,  is  stronger  than  required 
for  the  more  elementary  results  (for  example,  the  laws  of  large  numbers) 
discussed  in  this  chapter.  However,  the  loss  in  economy  of  assumption 
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2 


is  outweighed  by  the  accompanying  gain  in  ease  of  exposition. 
Furthermore,  the  main  result  (Theorem  2.1.1)  of  this  work  deals  only 
with  the  mgf  case. 

Our  main  goal  will  be  to  estimate  the  probability  of  the  event 

sn 

{ —  >  c  for  some  n  i  m} 
n  n 

when  m  is  large  for  a  specified  sequence  c  ■  (cn)  of  positive 

numbers.  It  is  natural  to  think  of  the  sequence  c  as  a  "boundary"  on 

the  growth  of  the  sequence  (S^/n)  of  sample  means  as  the  "time"  n 

increases.  Often  it  will  be  more  convenient  to  deal  either  with  the 

1/2 

standardized  process  (Sn/n  )  or  with  the  random  walk  S.  The 
corresponding  boundaries  will  be  denoted  as  follows: 


Process 

Value  at  time  n 

Boundary 

Value  at  time  n 

sample  means 

S 

n 

n 

c 

cn 

standardized 

sn 

Jn 

f 

?(n)  “  Vhcn 

random  walk 

S 

n 

g 

g(n)  -  .®f(n) 

3 


We  write  Z  for  a  standard  normal  random  variable. 

B  -  (B(t))ti0  denotes  Brownian  motion. 

We  denote  by  |_xj  the  Integer  part,  or  "floor",  of  x,  namely, 
the  largest  integer  not  exceeding  x.  Similarly,  |~x~|  ,  the 
"ceiling"  of  x,  is  the  smallest  Integer  at  least  as  large  as  x. 

If  k  i  1  is  an  integer,  denotes  k  iterations  of  the 

natural  logarithm  function  L  5  log. 

As  usual,  the  relation  a(t)  =  b(t)  means  that  a(t)  "  0(b(t)) 
and  b(t)  ■  0(a(t)). 

1.2.  Weak  law  of  large  numbers. 

Although  the  present  work  concerns  itself  with  convergence  rates 
related  to  the  strong  law  of  large  numbers,  we  begin  with  an  examination 
of  convergence  rates  related  to  the  weak  law.  There  are  two  reasons  for 
this  review:  the  weak-law  results  (1)  provide  motivation  for,  and  (2) 
are  used  in  the  proofs  of,  the  corresponding  strong- law  theorems. 

The  weak  law  of  large  numbers  (WLLN)  states  that  S  /m  ->  0 

m 

in  probability  as  m  co  ,  i.e. ,  that  for  any  constant  c  >  0 

S 

(1.2.1)  P{  1—1  >  c}  0 

in 

as  m  od  .  Treating  upper  and  lower  tails  separately,  we  can  write 
(1.2.1)  in  the  equivalent  form 
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P{^  >  c}  -»  0, 

(1.2.1a)  S 

P{-  —  >  c}  0, 

m 

since 

S  S  S 

(1.2.2)  P{  1—5-1  >  c}  -  P{—  >  c)  +  P{-  —  >  c}. 

mm  m 

The  convergence  rate  problem  for  the  WLLN  is  to  determine  the  left  side 
of  (1.2.1)  up  *o  a  factor  (1  +  o(l)).  In  light  of  (1.2.2)  this  can  be 
accomplished  by  estimating  in  the  same  way  the  single-tail  probabilities 
in  (1.2.1a).  Since  the  random  walk  (-S)  satisfies  the  assumptions  of 
Section  1.1  for  S  with  F  replaced  by  P{-X  £  *},  it  is  enough  to 
deal  with  the  upper  tail  probability  PtS^/m  >  c} . 

A  more  general  problem  is  to  determine  the  asymptotic  behavior  of 

S 

(1.2.3)  P{-^>c} 

m  m 

for  an  arbitrary  sequence  of  positive  numbers  cm.  In  view  of  the 
central  limit  theorem  (CLT)  for  S,  it  is  convenient  to  express  (1.2.3) 
in  the  standardized  form 

S 

(1.2.3a)  P{-r-“  >  ?(m)} 

vm 

where  we  define 

f(m)  -  ytoc. 

m 
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Indeed,  the  CLT  states  that  when  fCm)  =  is  constant, 

S 

(1.2.4)  >  V  P{Z  >  V* 

The  CLT  is  thus  an  invariance  principle  in  the  sense  that  the  right  side 
here  is  independent  of  F. 

The  Berry-Esseen  theorem  (see  Feller,  1971,  p.  542)  bounds  the 
error  in  approximating  the  left  side  of  (1.2.4)  by  the  right,  uniformly 
in  tyg.  As  a  particular  consequence, 

S 

(1.2.5a)  P{-7  m-  >  I (m) }  — >  0 

Vtn 

if  and  only  if 
(1.2.5b)  ?(m)  co  . 

The  general  problem  of  convergence  rates  related  to  the  WLLN  is  to 
determine  (1.2.3)  up  to  a  factor  (1  +  o(l))  when  (1.2.5)  is  in  force. 

Return  to  the  case  ?(m)  5  cm1^  of  (1.2.1).  Any  unified 
theory  for  handling  this  case  must  require  c  to  be  small  in  some 
sense.  For  example,  if  S  has  symmetric  Bernoulli  components,  i.e., 
if  X  assumes  the  values  ±1  with  probability  1/2  each,  then  for 
any  c  i  1 

S 

(1.2.6)  P{-j~  >  c}  »  0  for  every  m. 


The  identity  (1.2.6)  cannot  hold  for  any  c,  on  the  other  hand,  unless 
F  has  compact  support. 

Making  precise  the  condition  that  c  be  small,  Bahadur  and  Ranga 
Rao  (1960)  solved  the  WLLN  convergence  rate  problem.  Their  solution  is 
contained  in  Section  3.4;  see  (3.4.12)  with  (X  »  0.  In  marked 
contrast  to  the  invariance  principle  (1.2.4),  the  rate  of  convergence  in 
(1.2.5a)  in  this  case  depends  heavily  on  F.  In  fact,  different  choices 
for  F  give  different  convergence  rates  for  (1.2.5a)  for  some  c  >  0. 

The  case  !(m)  -  o(m1//2)  with  ?(m)  oo  , 

intermediate  to  the  CLT  case  of  constant  I  and  the  WLLN  case 
1  /2 

?(m)  *  cm  ,  was  resolved  by  Cramer  (1938): 

(1.2.7)  P{  -t  >  9(m)}  »  (1  +  o(l))P{Z  >  ?(m)  }  *exp[$2(m)^T^-X(— t-^-)  ] . 

vtn  vm  vm 

Here 

is  a  certain  power  series,  the  so-called  Cramer  series  for  F,  which 
converges  for  $  in  a  neighborhood  of  0.  For  each  k  the 
coefficient  X^  depends  on  the  moments  of  F  of  orders  up  to  and 
including  k  +  3;  for  example, 

\)  m  p*  •  \  I5€X  S  '  S(EX  '  ' 

For  a  precise  definition  of  X,  see  (2.1.1). 
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For  the  normal  tail  probability  on  the  right  in  (1.2.7)  we  have  the 
standard  estimate 

(1.2.8)  P{Z  >  ?(m) }  -  [v5W^(m)]"1exp[-  ij?2(m)] 

(as  usual,  a  -  b  means  the  same  as  a  ■  (1  +  o(l))  b  ). 

mm  m  m 

On  a  logarithmic  scale  the  correction  in  (1.2.7)  to  the  normal 
approximation  becomes  negligible: 

S  .  - 

log  >  f  (m)  }  -  -  |T(m). 

Even  on  the  probability  scale  of  (1.2.7),  the  correction  is  unnecessary 
if  $  does  not  grow  too  rapidly: 

?(m)  »  o(m^^)  implies 
S 

(1.2.9a)  P{~rS  >  f(m)}  -  P{Z  >  ?(m)}. 

Vul 

If  ?  is  allowed  to  Increase  somewhat  more  quickly,  the  correction 
requires  only  the  constant  term  Xq  from  the  Cramfer  series: 

?(m)  -  o^1^)  implies 

S  3 

(1.2.9b)  >  f  (m) }  -  P{Z  >  ?(m)}-exp(X0^lI]. 

The  linear  coefficient  X^  enters  next: 

$(m)  -  o(m3/1°)  implies 

S  3 

p{-^  >  ?(m)}  -  P{Z  >  1l(m)}‘exp{X0^^°-')  •exp[X1--^m^] . 


(1.2.9c) 


In  general,  if  f(m)  ->  oo  and  |(m)  ■  o(m1'^ 

with  0  <  T|  C  1/3,  then  only  the  moments  of  F  of  orders  up  to  and 
including  fl/Tf!  -  1  need  be  known  to  identify  the  convergence 
rate  (1.2.7). 

The  transitions  in  form  from  the  CLT  to  Cramer's  result  and 

from  CramSr's  result  to  the  WLLN  solution  are  smooth.  When  ?  is 

nearly  constant,  as  in  (1.2.9a),  CramSr's  result  is  an  invariance 

principle  of  the  same  form  as  the  CLT.  When,  at  the  other  extreme, 

1/2 

'P(m)  grows  nearly  as  quickly  as  m  ,  the  convergence  rate 

(1.2.7)  depends  heavily  on  F.  The  Bahadur  -  Ranga  Rao  result  for 
1  /2 

$(m)  ■  cm  can  be  stated  in  the  form 

(1.2.10)  P(4^  >  ¥(m)}  -  (1  +  p)P{Z  >  T(m)}exP[?2(m)l^-X(%^-)] 

Jul  vtn 

as  m  oo  ,  where  p  depends  on  c  and  heavily  on  F  but  vanishes 
in  the  limit  as  c  0.  Thus  (1.2.7)  may  be  regarded  as  the 

limiting  form  of  (1.2.10)  when  c  0. 

1.3.  Strong  law  of  large  numbers. 

According  to  the  strong  law  of  large  numbers  (SLLN), 

Sn/n  0  with  probability  1;  equivalently,  for  any  constant 

c  >  0 

S 

(1.3.1)  P{l — |  >  c  for  some  n  i  m}  ^  0  as  mfoo. 

n 


Analogous  to  (1.2.2)  is  the  decomposition 
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P{—  >  c  for  some  n  i  m} 
n 

Sn 

+  P{ - >  c  for  some  n  >  m} 

II 

S  S 

(1.3.2)  -  P{— >  c  for  some  pirn  and - 3.  >  c  for  some  q  i  m} 

P  9 

for  Che  probability  in  (1.3.1).  In  Section  4.3  we  show  that  the  last 
term  in  (1.3.2)  is  asymptotically  negligible  when  compared  to  the  sum  of 
the  first  two  terms.  So  we  consider  the  one-sided  version 

S 

(1.3.3)  P{ —  >  c  for  some  n  )  n}  f  0  as  mfoo 

n 

of  (1.3.1). 

A  more  general  problem  is  to  determine  the  asymptotic  behavior  of 

S 

(1.3.4)  p^  5  P{-^£—  ^  V(n)  f°r  n  i  m) 

for  an  arbitrary  sequence  of  positive  numbers  9(n).  No  matter  what 
the  sequence  V , 

S 

(1.3.5)  pffl  |  p  i  P{-^r—  ^  J(n)  i»o.  as  n  ->  cd  } . 

It  follows  from  the  Hewitt-Savage  zero-one  law  (Feller,  1971,  p.  124) 
that 

(1.3.6)  p  ■  0  or  p  ■  1. 

The  case  p  -  1  is  trivial  from  the  convergence  rate  viewpoint,  for 

then  p  “  1  for  every  m.  The  classification  of  boundaries  7 
Qt 
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according  to  the  dichotomy  (1.3.6)  is  effected  by  the  Kolmogorov  - 
Petrovski  -  ErdSs  -  Feller  integral  test  (cf.  Jain,  Jogdeo,  and 
Stout,  1975): 


KPEF  INTEGRAL  TEST.  If  0  <  f  t,  then 

Jcd|(j 


(1.3.7)  according  as 


CD?(t)  2 
-  e 


-  -k2(t)  < 


dt  oo  .  Q 


Note  that  the  criterion  (1.3.7)  is,  like  its  weak-law  analogue 
(1.2.5),  an  invariance  principle. 

There  are  counterparts  to  (1.3. 4-7)  for  Brownian  motion:  If 

p  £  P{B(t)  >  g(t)  for  some  t  i  s} 

8 

then 

pg  i  p  £  P{B(t)  >  g(t)  i.o.  as  t  oo  } 
and  (1.3.6)  and  the  KPEF  test  hold;  here 
(1.3.8)  g(t)  -  ^f(t). 

In  fact,  (1.3.7)  for  S  is  most  easily  proved  from  (1.3.7)  for  B  by 
showing  that  S  can  be  closely  approximated  by  B  (cf.  (3.1.6)). 

In  the  interesting  case  that  p  ■  0  in  (1.3.5)  we  say  that  g  is 
an  upper  class  boundary  for  the  random  walk  S  and  write  g  6  U(S). 
(Otherwise  g  is  a  lower  class  boundary  and  g  6  L(S).)  Define 
U(B)  and  L(B)  analogously.  If  If  f,  the  KPEF  tests  allow  us 
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to  write  g  6  U  indifferently  for  g  6  U(S)  (for  any  S  )  or 
g  €  U(B).  A  similar  comment  applies  to  the  notation  g  6  L. 

The  test  (1.3.7)  gives  rise  to  the  celebrated  law  of  the  Iterated 
logarithm  (LIL),  which  quite  precisely  describes  the  ‘’interface’’ 
between  U  and  L. 

LAW  OF  THE  ITERATED  LOGARITHM.  If 

(1.3.9)  g(t)  -  [2t(L2t  +  |L3t  +  L4t  +  •••  +  L?wlt  +  (1  +  p)L?t)]1/2 
with  (>  >  3,  then 

U  > 

(1.3.10)  g  e  according  as  p  0.  Q 

L  < 

The  general  problem  of  convergence  rates  related  to  the  SLLN  is  to 

determine  p  up  to  a  factor  (1  +  o(l))  when  g  6  U.  Let 
m 

T^  -  inf  {n:  n  i  m,  Sn  >  g(n)}»  the  inf  of  the  empty  set  being 

+oo .  Then  T  i  m,  and 
m 

p  -  P{S  >  g(n)  for  some  n  i  m}  »  P{T  <  co  } 
m  n  m 

admits  the  decomposition 

p  -  P{T  ■  m}  +  P{m  <  T  <  oo  } 
m  m  m 

(1.3.11)  -  P{Sm  >  g(m) }  +  P{m  <  Tm  <  oo}. 

The  convergence  rate  for  the  first  term  is  known  from  studying  the  WLLN; 
the  second  is  new.  For  a  simple  lower  bound  we  have 
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(1.3.12)  p  }  P{S  >  g(m) } . 
m  m 

Slegntund  (1975)  used  Che  relation  (1.3.11),  together  with  the 

Bahadur  -  Ranga  Rao  estimate  for  the  first  term  and  his  own  analysis  of 

the  second,  to  solve  the  convergence  rate  problem  in  the  SLLN  case 

g(t)  ■  ct.  Strassen  (1965)  solved  the  problem  for  boundaries 

g  6  U  not  too  far  from  the  U  \  L  interface  (1.3.9)  —  roughly 

3/5 

speaking,  for  g(t)  ■  o(t  )  as  t  ->  ao  .  The  major  contribution 
of  this  work  is  to  complete  the  solution  to  the  convergence  rate  problem 
by  bridging  the  gap  between  Strassen's  boundaries  and  Slegmund's. 

We  set  the  stage  by  reviewing,  in  the  next  two  sections,  the 
results  of  Strassen  and  Siegmund. 

1.4.  Strassen's  result. 

We  recall  the  omnibus  restriction  to  the  mgf  case.  Modulo  a 
precise  definition  of  the  adjective  "smooth",  Strassen's  result  (1965, 
thm.  1.4)  can  be  stated  as  follows: 

THEOREM  1.4.1  (Strassen).  If  g  e  U  has  a  smooth  derivative, 

0  <  V  t,  and  g(t)  C  ^  for  some  /  >  0,  then 


REMARK  1.4.2.  (a)  Strasaen  used  an  Intricate  argument  to  show  that 

(1.4.2)  P{B(t)  >  g(t)  for  some  t  i  s}  -  J  as  s  co 

8 

and  used  this,  along  with  approximation  of  S  by  B  via  Skorohod 

embedding  (see  Breiman,  1968),  to  deduce  his  Invariance  principle 

(1.4.1).  In  light  of  (1.2.9a)  one  might  expect  that  the  restriction  on 

2/3 

the  growth  of  g  could  be  eased  to  g(t)  ■  o(t  ).  This  can  In 
fact  be  done  (Theorem  3.2.1). 

(b)  Theorem  4.3  In  Strassen  (1963),  a  lemma  to  (1.4.1)  credited  by 
Strassen  to  F.  Jonas,  Is  In  error.  As  noted  by  Sawyer  (1972),  the 
Skorohod  embedding  time  for  X  may  not  have  finite  mgf  even  though  X 
does.  Theorem  3.2.1  repairs  the  proof  of  (and  yields  a  result  somewhat 
better  than)  Theorem  1.4.1  by  using  the  dyadic  quantile-transformation 
approximation  of  S  by  B  due  to  Komlos,  Major,  and 

TusnSdy  (1975;  1976)  instead  of  Skorohod  embedding. 

(c)  If  we  assume  g(t)/t'^"’  then  (compare  the  proof  of 


Lemma  2.2.1(b)) 
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which  is  the  tail  Integral  in  the  KPEF  test  (1.3.7).  Q 

EXAMPLE  1.4.3.  Define  g  6  U  at  the  U  \  L  Interface 
according  to  (1.3.9),  with  Q  >  3  and  p  >  0.  Then 

(1.4.3)  P{Sm  >  g(m) }  -  [2wSf(Lm)(L2m)2(L3m)  •••  (L?_2m) (L^m)1  +  Pf 1 
which  is  of  much  smaller  order  of  magnitude  than 

(1.4.4)  pm  -  [2jBp(L  jm/l"1- 

In  contrast  we  shall  see  for  Siegmund's  boundaries  and  for  those  of 
Theorem  1.6.1  (and  also  for  Strassen's  when  g  is  not  too  close  to 
L  )  that 

(1.4.5)  pa  =  P{Sm  >  g(m)}. 

The  extreme  reluctance  with  which  (1.4.4)  tends  to  zero  is  a 
well-known  phenomenon  connected  with  the  LIL.  Were  g  only  slightly 
smaller  we  would  have  g  6  L  and  hence  p^  ■  1  for  every 
m.  Q 

1.5.  Siegmund's  result. 

In  stating  Siegmund's  solution  to  the  SLLN  case  g(t)  *  ct  we 
assume  that  c  >  0  is  sufficiently  small.  The  criterion  of  smallness, 
detailed  in  Section  3.4,  is  the  same  as  for  the  Bahadur  -  Ranga  Rao  WLLN 
result.  We  further  assume  that  if  F  is  a  lattice  distribution  with 
span  h,  then  c  is  a  point  in  that  lattice. 
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THEOREM  1.5.1  (Siegmund).  If  g(t)  ■  ct  with  c  >  0  as  above, 
then  there  is  a  constant  )  >  0  for  which 

(1.5.1)  pm  -  <1  +  y)P{Sm  >  g(m)}.  Q 

REMARK  1.5.2.  (a)  Siegmund  determined  the  constant  ) 

explicitly  and  remarked  that 

(1.5.2)  y  1  as  c  -»  0. 

Nevertheless,  for  fixed  c  the  constant  )t  like  the  constant  p 
and  the  series  \  in  (1.2.10),  depends  heavily  on  the  component 
distribution  F.  So  Siegmund's  result,  unlike  Strassen's,  is  far  from 
an  invariance  principle. 

(b)  Siegmund  utilized  the  decomposition  (1.3.11).  In  analyzing  the 
second  term  he  used  the  fundamental  identity  of  sequential  analysis,  the 
large-deviation  result  of  Bahadur  and  Ranga  Rao,  and  some 
renewal-theoretic  calculations.  The  same  kind  of  approach  is  used  in 
proving  Lemma  2.4.1  to  Theorem  2.1.1. 

(c)  For  a  generalization  of  Siegmund's  theorem  to  linear 
boundaries  g  with  nonzero  intercept,  see  Theorem  3.4.1.  Q 


16 


1.6.  Completion  of  the  solution  to  the  convergence  rate  problem. 

The  solution  to  the  convergence  rate  problem  in  the  mgf  case  is 
completed  by  the  following  theorem  (cf.  Theorem  2.1.1),  which  overlaps 
somewhat  with  Strassen's: 


THEOREM  1.6.1.  If  g  has  a  smooth  derivative  and  satisfies  the 
mono tonic  growth  conditions 

for  some  S  >  0  and 

(1.6.2)  i  0, 


then  g  e  U  and 


(1.6.3) 


p  -  i  =  jL>> 

m  ®  jnV'(m) 


P{Sm  >  g(m) } . 


REMARK  1.6.2.  (a)  That  g  belongs  to  U  is  an  easy  consequence 

of  the  KPEF  test. 

(b)  The  theorem's  method  of  proof  requires  that  g  be  kept  away 
from  the  U  \  L  interface  and  from  linearity;  hence  the  growth 
conditions  (1.6.1)  and  (1.6.2). 

(c)  The  rate  of  convergence  of  the  factor  P{Sm  >  g(m)}  to  zero 
is  given  by  Cramer's  theorem  (1.2.7).  Thus  in  the  present  case  the 
rate  of  convergence  depends  on  F,  but  only  through  a  (typically)  finite 
number  of  cumulants  of  F. 
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(d)  One  might  expect  that  a  result  like  (1.4.1),  but  with  a 
Cr amir- like  correction  to  the  exponential  factor,  would  hold. 

Indeed,  we  show  in  Lemma  3.2.4  that  (1.6.3)  can  be  recast  in  the  form 

5 1:  •'  **to  — at. 

In  particular,  (1.4.1)  holds  for  g(t)  *  o(t2^). 

(e)  As  in  Siegmund's  case,  g  increases  rapidly  enough  that 
(1.4.5)  holds  (cf.  Lemma  2.2.1(k)).  Q 

EXAMPLE  1.6.3.  The  transitions  from  Theorems  1.4.1  to  1.6.1  to 
1.5.1  are  smooth.  Let  g(t)  *  t1^2  +  ^  with  0  <  8  <  1/2. 

Then 

P  -  (1  +  1/ (26) )?{S  >  g(m) } . 
m  m 

As  8  tends  to  its  lower  limit  0,  the  factor  (1  +  1/(28))  tends 
to  ao.  This  is  consistent  with  Example  1.4.3.  As  8  tends  to  its 
upper  limit  1/2,  1/(28)  tends  to  1,  which  is  consistent  with 

(1.5.2)  in  Remark  1.5.2(a).  Furthermore,  we  have  seen  in  Section  1.2 
that  the  form  of  P{Sm  >  g(m)}  varies  smoothly  from  the  normal 
approximation  to  Cramir's  theorem  to  the  Bahadur  -  Ranga  Rao 
result.  0 
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1.7.  A  sketch  of  the  proof. 

In  this  section  we  present  an  outline  of  the  proof  of  Theorem 
1.6.1.  For  a  precise  statement  and  proof  the  reader  Is  referred  to 
Chapter  2. 

We  shall  use  the  standard  terminology  "  S  crosses  the  boundary  g 
at  time  m  "  to  describe  the  event  {Sm  >  g(m)},  although  "  S  Is 
above  g  at  time  m  "  Is  perhaps  better.  Notice  that 

{Sn-!  >  gU  “I).  Sm  >  g(m) ,  >  g(m  +1)} 

is  included  in  this  event,  even  though  S  does  not  "cross”  from  one 
side  of  g  to  the  other. 

The  first  step  in  the  proof  is  to  restrict  to  a  finite  interval 

[m,  v  )  those  times  at  which  S  crosses  g  in  the  event 
m 

{Sn  >  g(n)  for  some  n  i  m}  (whose  probability  is  Pm  )•  This  is 

done  by  choosing  v^  so  as  to  satisfy  two  opposing  constraints: 

(1)  that  the  probability  of  a  g  -crossing  after  time  vm  be 

negligible;  and  (2)  that  g  be  virtually  linear  over  [m,  v  ).  The 

m 

"smooth  derivative"  condition  guarantees  that  g'  changes  slowly  enough 

to  admit  such  a  v  . 

m 

The  first  criterion  is  made  precise  by  using  elementary 
subadditivity  considerations  and  (1.3.12)  to  show  that 


(1.7.1) 
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where 


p„  „  =  P{S  >  g(n)  for  some  m  C  n  <  v }. 

m,  v  n  m 

m 

Using  Che  mean  value  theorem,  we  Chen  crap  Che  graph  of  g  over  Che 

time-interval  [m,  v  )  between  approximating  lower  and  upper  lines 

Jf  and  jf  ,  respectively,  both  passing  through  the  point  (m,  g(m)) 
— m  m 

and  having  slope  (1  +  o(l))  g'(m)  (see  Figure  1.7.1).  Writing  Jf 

indifferently  for  jf  or  T  ,  we  define 
— m  m 

Pm  ,,  OL)  =  P{S„  >  Jf  (°)  f°r  SOme  “  ^  11  <  ▼ 

m,  v  m  n  m  m 


Clearly 


(1.7.2)  o  „  (?)  <  Pjn  „  (pm„  (f  ), 


m,v  '  m  rm,v  rm,v  '-hh' 
’  m  m  ’  m 


which  reduces  the  problem  to  that  of  showing 


(1.7.3)  p  (Jf  )  -  I 
m.v  m  qi 
m 


with  Jf  ■  Jf  or  Jf  and  I  defined  in  (1.6.3). 
m  —m  m  m 


Let 


p(jf  )  =  P{S  >  Jf  (n)  for  some  n  >  m}. 
mm  am 

correspond  to  p_  „  (?)  as  p  corresponds  to  p_ 

m.v  m  m  m.v 

9  m  9  m 

If  T  5  inf  {n:  n  i  m,  S  „>  Jf  (n)}  (as  in  Section  1.3,  with 
m  n  m 

g  replaced  by  Jf  ),  then 


m 


Fig.  1.7.1.  g  is  trapped  between  approximating  lines 
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P_(l_)  -  P{T  <  oo} 

mm  m 

“  P{S  >  £  (m) }  +  P{m  <  T  <  oo  } 
mm  m 

(1.7.4)  -  P{S  >  g(m)}  +  P{m  <  T  <®}. 

m  m 

The  first  term  can  be  estimated  using  Cramer's  theorem  (1.2.7).  For 
the  second  we  use  Siegmund's  techniques:  the  fundamental  identity  of 
sequential  analysis,  Che  appropriate  large-deviation  result  (here, 
Cramgr's  theorem),  and  some  renewal  theory.  The  result  is 

(1.7.5)  P  ■(/)  -  I. 

mm  m 

From  this  we  prove  the  analogue 

(1.7.6)  p  <f  >  - 

m 

to  (1.7.1).  Equation  (1.7.3)  then  follows  from  (1.7.6)  and  (1.7.5), 
completing  the  proof  of  the  theorem. 

The  heart  of  the  proof  lies  with  (1.7.5).  It  is  enlightening  to 
examine  the  proof  of  (1.7.5)  under  the  simplifying  assumption  that 
F  ■  J,  the  standard  normal  distribution  function. 

In  order  to  apply  the  fundamental  identity  of  sequential  analysis 
we  first  need  to  introduce  the  family  of  distributions  associated  with 
F  through  exponential  tilting.  In  the  special  case  F  ■  J 
exponential  tilting  amounts  to  nothing  more  than  a  shift  of  location. 
Accordingly,  let  PQ  denote  the  probability  under  which  , 

• •  •  are  independent  and  normally  distributed  with  mean  9  and 
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unic  variance.  If 


<(n)  =  <X  +  6  n 
is  mm 


and  T  is  redefined  by 
m 


T  i  inf{n:  n  1  m,  S  >  (X  } , 
m  n  m  * 


then 


(1.7.7)  P  (£)  -  P { S  >  g(m) }  +  P  {m  <  T  <  on  } . 
in  m  m  ^  —w  )  m 


The  idea  now  is  to  tilt  from  P  to  P£  .  There  are 

1  nr  m 


two  reasons  for  this.  First,  since  (X)  >  0,  we  have  by  the 
SLLN  the  simplification 


m 


(1.7.8)  {m  <  T  <  oo  }  -  (T  >  m}  -  {S  C(X}  a.s.  P~  . 

m  m  mm  % 

m 


Second,  letting  denote  the  restriction  of  P^  to 

the  <r  -field  generated  by  X^^ ,  x2 . Xn,  the  Radon  -  Nikodym 

derivative  of  P^g  \  with  respect  to  p£n^ 
nr  m 

assumes  the  particularly  simple  form 


dP 


(n) 


(1‘7-9)  ■  '(a)  ’  exP<“2€mSn)* 


dP, 


m 


p(-«  »>  ■  W(-«  ),T»-  »> 

in  m 


Thus 
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“  ^n>m  J {Tffl  -  n}  ®xp(“2easn>  dpSa 

a-7'10>  ‘  J(.  <  T  <  »  }  'xp<-2«.ST  >  "«  • 

m  m  m 

demonstrating  the  fundamental  identity  of  sequential  analysis. 
Recalling  (1.7.8), 

P(-C  ){m  <  Tm  <  00  }  -  J{S  <  a  }  eXp(-2CmST  >  dPC 

m  mo  mm 

(1.7.11)  -  exp(-2C  a  )  Pc  {S  (8) 

mm  m  m 

.  m 

'J  -V'  d<P«  l'|S,(«1ll. 

w  m  m 


The  second  factor  on  the  right  in  (1.7.11)  is 


P  {S  i  (X  } 
%  m  m 


m 


P{S  £  (X  -6  m} 
m  m  m 


P{~S  i  -<X  +  6  m} 
m  mm 


(1.7.12) 


-  P{S  >  -QL  +  e_m} , 
m  mm 


making  use  of  the  symmetry  and  continuity  of  J.  One  can  show  that 


jk«  m  -  (X  )  -  2*fflf'(m)  =  f(m)  -»  co  ; 
v®  m  m 

hence  (1.2.8)  implies  that  (1.7.12)  equals 

(1  +  o(l))(21»)'1/2[2m^'(m)]'1exp[-  ^nf^m  -  Q^)2) 
-  U  +  o<l))«l,(2«A),!jj|=5P(S,  >  8(a)). 


(1.7.13) 
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For  general  F,  Cramer's  theorem  (1.2.7)  would  be  needed  to  evaluate 
the  final  probability  in  (1.7.12). 

Finally,  the  integrand  in  the  third  factor  on  the  right  in  (1.7.11) 

is  the  exponential  of  a  product  of  two  factors:  C  ,  which  equals 

m 

(1  +  o(l))  g'(m)  and  so  (see  Lemma  2.2.1(b)  and  (1.6.2))  tends  to  zero 

as  m  ->  oo  ,  and  ST  -  0^,  the  amount  by  which  S  first 

m 

overshoots  the  level  (X  at  or  after  time  m.  Using  a 

in 

renewal-theoretic  result  of  Lorden  (1970)  one  can  show  that  this  excess 

is  of  smaller  order  of  magnitude  than  1/C  and  from  this  that  the 

m 

integral  on  the  right  in  (1.7.11)  tends  to  1  as  m  oo  . 

Combining  the  results  of  our  calculations, 

n-7-1*)  p»««>  ■  >  s<"»[i  +  <i  + 

which  can  be  reduced  without  difficulty  to  (1.7.5). 

1.8.  Extensions. 

Theorem  1.6.1  is  restated  and  proved  in  detail  in  Chapter  2.  In 
Chapter  3  we  give  a  correct  proof  of  Strassen's  theorem  and  a  slight 
generalization  of  Siegmund's.  Chapter  4  discusses  whether  >  can  be 
changed  to  ^  or  vice  versa  in  the  definition 

p  ■  P{S^  >  g(n)  for  some  n  i  m} 
m  n 


without  affecting  the  convergence  rate  and  considers  the  case  of  two 
boundaries.  In  Chapter  5  we  obtain  a  partial  asymptotic  expansion  for 


p  In  Strassen's  case.  The  expansion  given  by  this  invariance 

Bi 

principle  contains  a  greater  number  of  terms  the  nearer  g  is  to  the 
U  \  L  interface  and  in  fact  forms  a  complete  asymptotic  series 
when  g  leaves  the  range  of  Theorem  1.6.1.  We  also  get  asymptotic 
upper  bounds  on  the  relative  error  in  the  approximation  (1.6.3). 

We  conclude  this  summary  by  listing  without  comment  four  problems 
ripe  for  future  research. 

(1)  Develop  complete  asymptotic  expansions  for  p^  to  extend 
Theorems  1.5.1  and  1.6.1. 


(2)  Increase  the  dimensions  of  both  X  ("state")  and  m  ("time"). 

(3)  Allow  the  components  to  have  unequal  distributions. 

(4)  Treat  the  non-mgf  case.  In  other  words,  what  results  can  be 
salvaged  when  only  finiteness  of  low-order  moments  is  assumed? 


CHAPTER  II 


COMPLETION  OF  THE  SOLUTION  TO  THE  CONVERGENCE  RATE  PROBLEM 

In  this  chapter  we  more  carefully  state  and  prove  Theorem  1.6.1, 
thereby  completing  the  solution  to  the  general  problem  of  convergence 
rates  related  to  the  SLLN. 

2.1.  Statement  of  theorem. 

Let  X,  X^,  X2,  •••  be  independent  random  variables  with 
common  distribution  function  F,  zero  mean,  unit  variance,  and  moment 
generating  function  Eexp<5X)  finite  for  5  *-n  some  neighborhood 

of  0,  and  put 

Sa  -  Cl 

Let 

K(5)  -  logCEe5*) 

denote  the  cumulant  gererating  function  corresponding  to  F.  The 
so-called  Cramer  series 

K5>  •  Co  V5k 

for  F  is  defined  implicitly  for  $  near  0  by 
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(2.1.1)  53x<5)  -  K(z)  -  *5  +  ^$2, 

z  (  5  z(J)  )  given  by  K'(z)  -  J. 
Let  g:  (0,  cd  )  ->  (0,  cd  )  and  write 
g(t)  -  Jt¥(t). 

Define 

(2.1.2)  p  -  P{S  >  g(n)  for  some  n  i  m}. 

m  n 

THEOREM  2.1.1.  Suppose  that  as  t  t  cs 


for  some  0  <  S  <  1/2  and 

(2.1.4)  |  0. 

If  g  is  continuously  differentiable  and  if  for  some  0  <  r  <  1 

(2.1.5)  g'(u)  -  g'(t) 

when  t,  u  ao  with  t  (  u  (  t(l  i  l/1f2r(t)], 
then  g  6  U  and 

(2*1*6)  Pa"  ImiJ^7P{Sm>  8(m)}  as  B  *  ®  *  0 
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REMARK  2.1.2.  (a)  The  conclusion  (2.1.6)  continues  to  hold  If  the 

various  regularity  conditions  imposed  on  g  are  assumed  to  hold  only 
for  large  t. 

(b)  The  technical  interpretation  of  (2.1.5)  is  that  both  the  sup 
and  inf  of  the  sets 

(2.1.7)  :  t  *  u  £  tfl  +  l/?2r(t)J) 

tend  to  unity  as  t  ao  .  This  assumption  is  implied  by  the 
condition 

(2.1.8)  g'(u)  ~  g'(t)  as  u  ~  t  co  , 

which  we  interpret  to  mean  that  for  any  function  s  of  t,  if 
s(t)  -  t  as  t  oo  ,  then  both  the  sup  and  inf  of  the  sets 

:  u  is  between  t  and  s(t)} 

g  (t; 

tend  to  unity  as  t  -¥  ao  . 

(c)  The  rate  of  convergence  for  P{Sm  >  g(m) }  is  specified  by 
(1.2. 7-8).  Q 

2.2.  Some  facts  about  the  boundary. 

The  present  section  is  reserved  for  a  list  of  elementary  properties 
of  g  resulting  from  the  assumptions  (2. 1.3-5). 
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LEMMA  2.2.1.  Let  g:  (0,  oo  )  ->  (0,  oo  )  be  continuously 
differentiable  and  satisfy  (2. 1.3-5),  and  let  g(t)  -  t*‘/^?(t). 
Then 

(a)  t  (whence  V(t)  t  oo  )  and  'V  0; 

(b)  (j+S)^ig'(t)^; 


(c)  r(t)  -  t_1/2(g'(t)  -  1 1^1); 

(d)  6  c  ?'(t)  *  y 

<•> 

(f,  o  4  4  -  $>I^; 

(g)  0  4  4  -  S>^§2-  -  0(5^-)  as  t  -»  oo ; 

(h)  ?(u)  -  ?(t)  as  u  -  t  ->  as 

(1)  $2(u)  -  ?2(t)  ->  0  when  t,  u  ->  oo  with  t  £  u  C  t  +  1; 


by  (2.1.4).  The  first  inequality  follows  similarly  from  (2.1.3). 
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(d)  Combine  (b)  and  (c). 

(f)  Combine  (d)  and  (e). 

(g)  Use  (f)  and  (a). 

(h)  If  t  £  u  then  (a)  yields 

f(t)  £  ¥(u)  £  (^)1/2f(t)  -  (1  +  o(l))$(t); 
the  case  t  >  u  is  handled  similarly. 

(i)  By  (a),  the  mean  value  theorem,  (d),  and  (h), 

0  £  ?2(u)  -  f2(t)  £  ?2(t  +  1)  -  1p2(t)  £  f(t  +  1)  •  ^2- 
<  (1  +  oU))-!--^-  -  o(l). 

(j)  Use  (c),  (b),  and  (h)  and  (2.1.5). 

(k)  Use  (c)  and  (b).  0 

In  Sections  2.3  through  2.6  we  prove  Theorem  2.1.1  following  the 
outline  of  Section  1.7. 


Throughout  this  section  “fact  (•)"  refers  to  part  (•)  of  Lemma 


2.2.1. 

Recall  that  we  wish  to  define  v  >  m  in  such  a  way  that 

m 

(2.3.1)  pv  «  o(pm)  as  m  oo 

m 

but  g  is  virtually  linear  over  [m,  v  ).  As  it  turns  out,  an 

m 

appropriate  choice  is 

(2.3.2)  vm  -  [_m(  1  +  l/I?2r(m))J  . 

Fact  (a)  implies  that 

(2.3.3)  v  -  m 

m 

and  (2.1.5)  yields 

(2.3.4)  g'(t)  -  g'(m)  when  m,  t  oo  with  m  i.  t  £.  v  . 

m 

LEMMA  2.3.1.  If  v^  is  defined  by  (2.3.2),  then  (2.3.1)  holds. 
PROOF .  By  subadditivity 

(2.3.5)  P>  <  P(Sn  >  g(n)}; 

we'll  replace  m  by  v^  in  (2.3.5)  to  get  an  asymptotic  upper  bound 

on  pv  .  By  CramSr's  theorem, 
m 

P(Sn  >  g(n) }  -  [^n?(n)]_1exp{- -^2(n)[l  -  2^-X(^-)  ] } 
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(2.3.6)  -  [ (n) ]_1exp{-  ^2(n)}, 
where 

(2.3.7)  5(t)  =  ($2(t)[l  -  2li^-X(^-)]}1/2- 
Note 

(2.3.8)  f(t)  -  f(t)  as  t  oo  . 


Also,  as  t  ->  cd 


iidlht)  -  2»Cc)1'(t)!l  - 


-  (1  +  o(l))2$(t)?'(t) 

-  (1  +  o(l))2>0f2(t)  • 

-  (1  +  o(l))2^(t)?'(t) 

by  facts  (d)  and  (g),  so  that 

(2.3.9)  ?'(t)  -  ?'(t) 

Hence  when  t,  u  ao  with  tiuit  +  1 

2 

(2.3.10)  0  £  ?2(u)  -  f2(t)  £  (1  +  0(l ))?--(.£!  „  o(1) 

(cf.  proof  of  fact  (i)).  From  (2.3.5-10)  and  facts  (h)  and  (d)  follows 
V2ffpm  C  (1  +  o(l))  [?(n)]"’1  exp[-  yf2(n)] 
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-  (1  +  o(l)) 

*  <1  +  0(1)) 


jr  ^(t>1”1  exp["  i  ^2(t)]  dt 

s_1  F00^—  exP[-  i?2( t) ]  r(t)  dt 

J"nt) 


But 


ocfV'8  - 

J  m 


(t)  exP[-  iv2( t) ]  r(t)  dt). 


J“ 


00  51/S  -  2 


?i/0  "  ^(t)  exp[-  -pr2(t)]  f'(t)  dt 


1  2 

1/8-2  ~T  . 

e  du 


-  f1/S  ~  3(m)exp[-  £P(m)] 


so 

(2.3.11)  pm  -  0(71/S  "  3(m)exp[-  ^2(m)]). 

Now  by  the  mean  value  theorem,  (2.3.8),  (2.3.9),  facts  (J)  and  (d),  and 
(2.3.2), 


~2 

f2(v  )  -  f2(ni)  i  (1  +  o(l))2S^-^(vm  -  m) 

m  mm 


-  (1  +  o(1))28^2(1  ‘  r)(m) 


and  thus  (2.3.11)  (with  m  replaced  by  )  easily  yields 


pv  -  o([w5i»?(m)]'1exp[-  ^f2(m)]) 
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(2.3.12)  -  o(P{Sm  >  g(n)})* 

To  complete  the  proof  of  Lemma  2.3.1  use  the  obvious  bound 

(2.3.13)  pm  i  P{Sffl  >  g(m)}.  0 


Define 

(2.3.14)  p  -  P{S  >  g(n)  for  some  m  C  n  <  v  }; 

rm, v  n  °  m 

*  m 


clearly, 


(2.3.15)  p  <  p  i.  p  +  p 
*in,v  ra.v  rv 

m  m  m 


We  therefore  immediately  obtain 

COROLLARY  2.3.2.  With  p  given  by  (2.1.2)  and  p_  by 
-  r  tn  tn,  v 

m 

(2.3.14), 

(2.3.16)  p  -  p  asm— >oo.  Q 

rm  rm,v  u 

’  m 

2.4.  Linearization  of  the  boundary. 

We  define  here  straight  lines  (  and  T  .  both  passing  through 

— m  tn 

the  point  (m,  g(m)),  which  well  approximate  g  over  the  interval 
[m,  v  ]  (Fig.  1.7.1).  The  definition  of  (respectively,  J?m  ) 

together  with  the  mean  value  theorem  will  imply  that  this  line  minorizes 
(majorizes)  g  on  [m,  v  ]. 
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The  slope  C  (respectively,  €  )  of  SL  (  5,  )  is 

— m  in  “in  m 

defined  to  be  the  minimum  (maximum)  value  of  g'  over  the  interval 

tm,  v  ] .  We  now  treat  both  lines  at  once  by  writing,  for  example, 

l  indifferently  for  l  or  T  •  By  (2.3.4) 
m  — m  m 

(2.4.1)  C  -  g'(m)  as  m  ->  co  . 

m 

The  y  -intercept  of  t  is 

m 

(2.4.2)  (X  »  g(m)  -  mfi  . 

m  m 

In  analogy  with  (2.1.2)  and  (2.3.14)  define 

(2.4.3)  p  (Jt  )  ■  P{S  >  t  (n)  for  some  n  i  m} 

mm  n  m 

and 

(2.4.4)  p  (£  )  •■  P{S  >  t  (n)  for  some  m  C  n  <  v  }. 

m,  v  mam  m 

m 

The  key  to  analyzing  (2.4.3)  is  the  following  lemma,  to  be  proved  in  the 
next  section. 

LEMMA  2.4.1.  Let  denote  the  straight  line 

“  ■■  ■  ■  m 

(2.4.5)  £  (t)  -  (X  +  6t 

m  mm 

and  define  p  (I)  by  (2.4.3).  If  €  >  0  satisfies 

mm  m 

(2.4.6)  6  0 

m 


and 
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(2.4.7)  ^  cd  , 

m 


and  if 


(2.4.8) 


lim  sup 
m  ->  cd 


la 


m 


6  m 
m 


<  1, 


then 

26 

<2'4'9>  - V'P{S»>,m<"»  “  »->»•  0 

e  — 51 

m  m 

REMARK  2.4.2.  The  conditions  (2. 4. 6-7)  demand  that  C_  tend  to 
-  m 

0,  but  not  too  quickly.  Assumption  (2.4.8)  requires,  loosely  speaking, 

that  the  limiting  proportional  contribution  of  the  constant  term  to  the 

value  at  t  »  m  of  either  (  ( t)  ■  (X  +6t  or 

m  mm 

f  (t)  ■  -<X  +  €  t  (which  arose  in  (1.7.12)  for  the  example 
m  m  m  r 

F  ■  J  )  is  less  than  1/2.  It  follows  from  (2. 4. 8-9)  that 

<■,<*„> '  ?lsm  >  *„<•»•  o 

According  to  (2. 1.3-4),  (2.4. 1-2),  and  Lemma  2.2.1(b),  (  -  ( 

m  — m 

or  l  satisfies  the  assumptions  of  the  lemma.  Recall  that 
m 

(2.4.10)  fa(m)  -  g(m) . 

Furthermore,  by  (2.4.2),  (2.4.1),  and  Lemma  2.2.1(b)-(c) 

a  ,  ,  . 

c  — ^  -  2[e  -ifiisi] 

m  m  1  m  2  m  1 
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-  2((1  +  o(l))g'(a)  -  J^-] 

-  (1  +  o(l))2[g'(m)  - 

(2.4.11)  -  (1  +  o(l))2w0f'(m). 

Combining  (2.4.1)  and  (2.4.9-11)  we  get 

(2.4.12)  P  (f  )  ~  p*s„,  >  8<®)>  “  Im  as  ■  “>  ®  . 

a  m  ^lp'(m)  m  m 

completing  the  analysis  of  (2.4.3). 

In  Lemma  2.4.3  below  we'll  prove  the  analogue 

(2.4.13)  pv  (j?m)  I  P{$n  >  fm(n)  for  some  n  i  vj  -  oipjtj) 

m 

to  (2.3.1).  As  an  immediate  corollary  (cf.  Corollary  2.3.2), 

(2*4-U)  Pm.v  <*a>  -  as 

m 

We  combine  (2.4.14)  and  (2.4.12)  to  obtain  the  rate  of  convergence  for 
(2.4.4): 


(2.4.15)  p  (fn)  -  lm  as  m  oo 

in 


Moreover,  since  g(n)  is  trapped  between  ^(n)  and  fm(n)  we  have 


(2.4.16)  p  <7„>  i  P  <  P  (!«>• 

*  m  m  m 


The  main  result  (2.1.6)  follows  from  (2.3.16)  and  (2.4.15-16). 
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The  remainder  of  this  section  is  devoted  to  the  following  corollary 
to  Lemma  2.4.1. 

LEMMA  2.4.3.  With  l  -  f  or  7  .  (2.4.13)  holds. 

-  m  —in  in 

PROOF .  To  begin  the  proof  of  (2.4.13)  we  apply  Lemma  2.4.1  to  the 
left  side  to  yield 

(2.4.17)  „v  (tj  -  — P(S„  >  <„(»„)). 
m  in  m 

m  v 

m 

recalling  (2.3.3)  to  verify  the  hypotheses  (2. 4. 7-8).  In  light  of 
(2.3.3)  and  (2.4.8),  the  first  factor  on  the  right  in  (2.4.17) 
asymptotes  to  the  first  factor  on  the  right  in  (2.4.9),  or,  in  the 
present  context,  to  the  first  factor  in  I  (recall  the  proof  of 
(2.4.12)).  Moreover,  by  CramSr's  result 


P{S 


v 

m 


(2.4.18) 


-  -  2^<Vm> 


m' 


Jv, 


m 


?  (v  ) 

*<-3ra“)lh 

m 


here  we  have  written 
(2.4.19)  Jfm(t)  -  <*\(t) 
so  that  ^  (m)  ■  ?(m)  and 

CD 

f  (v  )  •  V  (v  ) 

nr  nr  m  o'*  nr 
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(2.4.20) 

Put 

(2.4.21) 

then 

(2.4.22) 

But  with 

(2.4.23) 

so  that 

(2.4.24) 


.  v  ”1/2(a  +  6 v ) 
n  a  on 

-  (1  +  o(l))n”1/2(0Cm  +  €mm) 

-  (1  +  o(l))lT(m). 


Vfc>  V°  1/2 

fm(t)  -  {Tffl2(t)[i  -  2—^ — x(  jf. — )  1  >  ; 


p{Sv  >  <nCv0)>  -  lJ5S*(.)fW 

m 


f(5)  ■  ?2U  '  25M5>1  -  52  *«  5  -*  °> 


£'(5)  -  2J|1  -  2|X(5»  -  2?2IX(5)  +  JX'«>]  -  2?  »»  S'*0’ 


?  (V  )  ?_(«) 

ffl  ni  ~  „  01 


\2<V  -  \2™  ■  V(-5^f->  '  *f(_5_) 


?_<») 


?  (m)  1 


we  have 
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So 


T(m') 

.(1  +  o(l))m  •  2TyC-y- 


l_(v  ) 

in  m  , 

A 


m 


+  u  +  .a>,^^ 


a  am 

■(1  +  0(D)  *  2wfflP(m)[(Cm  +  — )  -  («n  +  7")1 


m 


+  (1  +  o(l))If2(1  ”  r)(m) 


-(1  +  o(D)  *  2^|(m)^(l  -  ~~)  +  (1  +  o(D)?2(1  "  r)C»> 


m 


-  -(1  +  o(l)>  *  2JS91  ~  2r(m)  *  +  O  +  r)(m) 

-  -a  +  oudvEY1  *  2r(»)  *  2~ 

+  (1  +  o(D)^1  "  2r(m>  *  (*m  +  ^ 

-  (l  +  O(D)^1  '  2r(“>  •  («B  -  TT> 

.  (l  +  0<D)  •  2m?'(m)71  "  2r(m)  (recall  (2.4.11)) 
i  (1  +  o(l))  •  2SI2(1  "  r^(m)  (Lemaa  2.2.1(d)). 


Pv  (*„>  pfsvB  2  *.‘V> 

Tji g~ ' (1  +  0(l))  hsI  >  *<»>' 
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-  (1  +  o(l))  •  exp{-  jlVa2iva)-  fo2<“0]}  "  o(l), 

completing  the  proof  of  (2.4.13).  Q 
2.5.  Line-crossing  probabilities. 

This  section  is  devoted  to  the 

PROOF  of  Lemma  2.4.1.  (^first  need  to  establish  some  notation. 

Define  the  distribution  function  F  by 

m 

(2.5.1)  Fm(x)  -  F(6m  +  x); 

in  other  words,  if  X  is  distributed  according  to  F  ,  then 

IB 

X  -  C  has  distribution  function  F.  The  effect  of  F  will  be  to 
m  m 

replace  the  linear  boundary  by  a  horizontal  one:  in  obvious 

notation, 

Pm(|tm)  •  PP  {S  >  a  for  some  n  i  m). 
mm  r  n  m 
m 

Let  Km  denote  the  cumulant  generating  function  corresponding  to 

F  so  that 
m 

(2.5.2)  Ko(5)  -  K(5)  -  8m5. 

Recall  the  assumption  that  X  is  finite  in  an  open  interval  1 
containing  0;  Kffl  is  also  finite  in  this  interval. 
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Since  F  has  zero  mean  and  unit  variance,  it  is  well-known  that 

(2.5.3)  K(§)  -  ^2,  K'(5)  -  K”(§)  -»  1  as  5  0, 

and  that 

(2.5.4)  K(0)  -  0,  K'(0)  -  0,  K"(5)  >  0  for  all  J  6  I. 

From  (2.5.2)  and  (2.5.4) 

(2.5.5)  K  (0)  -  0,  K  '(0)  -  -6  ,  K  ”(§)  >  0  for  all  lei. 

m  id  id  m  ^ 

Hence  there  exists  at  most  one  nonzero  value  §^(m),  necessarily 
positive,  for  which  5^ Cm) )  »  0.  We  now  show  that  5^(m) 
exists  for  sufficiently  large  m  and  that 

(2.5.6)  (m)  ~  26  as  m  ->  oo  . 

1  m 

Indeed,  let  3m  ->  0  be  an  arbitrary  real  sequence.  By 

(2. 5. 2-3), 

(2.5. 7)  Km(5ffl)  -  (1  +  o(D)  •  -  Cm5m  as  m  -»  oo  . 

In  particular,  with  l  5  C  •  26  (C  >  0) 

^m  m 

(2.5.8)  K  (5 )  -  2C6  2[(1  +  o(l))C  -  1]. 

ID  ul  ID 

If  C  <  1,  then  K  («  )  -  -2C(1  -  C)6  2  <  0;  if  C  >  1,  then 

ID  ID  ID 

K  (*)  “  2C(C  -  1)6  2  >  0.  This  shows  that  (m)  exists 
for  sufficiently  large  m  and  that,  for  any  pair  of  constants 
0  <  C  <  1  <  C',  C  <  51(m)/(26m)  <  C'  for  large  m;  (2.5.6) 
follows. 
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Similarly,  for  large  m  there  exists  exactly  one  point 
5g(m)  e  (0,  J^C®))  at  which  K  '  vanishes,  and 

(2.5.9)  $q(ih)  "  as  m  oo  . 

This  follows  from  the  fact  that  if  5  0,  then 

(2.5.10)  Km'(5m)  -  (1  +  o(l))§m  -  €m  as  m  cd  . 

Next,  let  P  denote  the  probability  under  which  X,  X,  , 
m  i 

)  •••  1* i« d«  with 

(2.5.11)  Pm{X  0  dx}  -  exp[50(m)  •  x  -  Km(50(m))]  Fm(dx). 

P  has  cgf 
m 

•  K.<50<“>  +  °>  -  W"» 

(2.5.12)  -  M5q(.)  +  9)  -  K(50(«))  -  «m9; 
note  that 

(2.5.13)  <5m(0)  -  rfB'(0)  -  0,  -  K"(50(m)). 

Put 

(2.5.14)  90(m)  -  -50(m),  S^tn)  -  ^(m)  -  5Q(m). 


Then 
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(2.5.15)  0„(m)  “  *  9.  (m)  ”  S  aa  m  ->  oo 

u  mi  m 

and 

^n(e0(m))  -  ^(e^m))  -  >Km(50(m))  -  -(K($Q(m))  +  i^Cm)) 

(2.5.16)  -  as  m  ®  . 

We  now  introduce  the  distributions  associated  through  “exponential 

tilting"  with  that  of  X  under  P^.  For  each  real  0  for  which 

6  (0)  <  co  let  P  .  be  the  probability  under  which  X, 
at 

X^,  X^,  ...  are  i.i.d.  with 

P  „{X  edx}  -  exp[0x  -  i  (0)]  P  {X  6  dx} 
m,0  r  mm 

(2-5-17)  -  exp[0x  -  ^(0)]  exp[§0(m)  •  x  -  Km(50(m))]  Fffl(dx). 

The  corresponding  cgf  i  -  is  given  by 

‘  *m(9  +1»>  - 

-  V30(m)  +  9  +  1!>  -  V5o(m)  +  9> 

(2.5.18)  -  K(50(m)  +  9  +  T|)  -  K(JQ(m)  +  0)  -  6^. 

In  particular, 

(2.5.19)  EnjeX  -  *m>e'(0)  -  V(9)  <’  -»  or  >  0 

according  as  0  <,  ■,  or  >  0 

by  (2.5.13)  and  the  strict  convexity  of  i  . 

m 


As  special  cases  of  (2.5.17) 


(2.5.20)  -  Fm<dx)  "  F<em  +  dx> 

(cf.  (2.5.14),  (2.5.16),  and  (2.5.1)).  Hence  by  (2.4.3)  and  (2.4.5) 


(2.5.21)  p  (f  ) 

m  ID 


P  Q  ,  .{S  >  «  for  some  n  i  m}. 

m,WQ(m)  n  m 


Without  loss  of  generality  we  consider  P  to  be  the 

distribution  of  X^,  X2,  ...,  defined  on  the  space  of  (infinite) 

sequences  of  real  numbers.  Accordingly,  let  P^n^ 

denote  the  restriction  of  P  _  to  the  or  -algebra  generated  by 

m,» 

the  first  n  coordinates  (  n  ■  1,  2,  ...  ).  Then  for  any  ©'  and 
0",  P(n2,  and  P(n2„  are  mutually 

absolutely  continuous,  and  by  (2.5.17) 

dPin©' 

(2.5.22)  — -  exp{(9'  -  e")Sn  -  n[*B( 0')  -  *B(0")]}. 

dPm,9” 
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In  particular,  by  (2.5.14),  (2.5.16),  and  (2.5.22), 


dP 


(r) 
m,e0(m) 


(2-5. 23)  -~(n)  “  exp[-51(m)Sn]  (  n  -  1,  2,  ...  ), 


m,01(m) 


Let 


T  *  inf{n:  n  i  m,  S  >  (X  } , 
m  *  n  n  * 


the  inf  of  the  empty  set  being  +co  .  Then  by  (2.5.21) 


(2.5.24)  p  (jf  )  -  Pra  ,  ,{Sm  >  <X  }  +  P_  /  .{m  <  T  <  go}. 
m  m  m,0Q(m)  m  m  m,0Q(m)  m 


By  (2.4.5)  and  (2.5.20),  the  first  probability  in  (2.5.24)  is 
P{Sm  >  ^(m)}.  To  complete  the  proof  of  (2.4.9)  we  shall  use  the 
approach  of  Siegmund  (1975)  to  show  that 

(X 

6  +  — 

(2.5.25)  Pm>e  (m){m  <  Tm  <  oo}  -  — - *  P{Sm  >  *m(m)}  as  m  ->  oo 

0  g _ o 

m  m 


We  first  apply  the  fundamental  identity  of  sequential  analysis,  to 
wit:  by  (2.5.23) 


Pm  A  <  Tn,  <  00  } 

m,90(m)  m 


•  C-i  J(I  -  „)  "... 

m  i 


(m) 
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-J(»<Tn<a,)  “Pl-JiO'V  dp«.e1 


(») 


(2.5.26) 


expt^mXXJ 

'  J{.  <  Tm  <  -  VI  dP..8l(.)- 


Recalling  e^m)  >  0,  (2.5.19)  implies  Em  Q  >  0;  hence  by 

the  SLLN 


{ 


m  <  Tm  <  ® }  "  {o  <  Tm  C  00  }  "  {Sm  C  *m}  a,s*  P«,0l(«)' 


and  the  final  integral  in  (2.5.26)  equals 


(2.5.27) 


Its  can  -VI 

mm  “  * 

-P  .  ,  AS  £<X}-E  _  ,  .(exp[-§.(m)(ST  -a)]  |S  ia  ) 

m,6^(m)  m  m  m,0^(m)  J1  m  m  mm 


Pm,ei(m)^Sm  *  °m} 


J 


E  .  .(exp[-51(m)(ST  -<^)]  IS,  -  0Cm  -  y) 


[0,ao  )  ®,9^(m) 


m 


•  p  o  /•  x{s_  e  a  -  dy  I  sm  c  a  }, 

m,9^ (m)  mm  m  m 


The  last  conditional  expectation  in  (2.5.27)  is 


(2.5.28)  !„i»i(„)(^K1CKSr(,)  -  y)l>. 
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where  TCy)  ■  inf{n:  >  y}.  We  show  below  that  the  expression 

(2.5.28)  tends  uniformly  in  0  (  y  <  co  to  1  as  m  — >  oo  ;  thus 

(2.5.29)  P  {m  <  T  <  co  }  ~  exp[>5  (m)<X  ]  •  P  ,  ,{S  <  <X  }, 


Moreover,  mimicking  the  proof  of  Cramer's  theorem  we'll  show  in 
Section  2.6  that 


a 

m 

e  +  — 

m  m 


(2-5-30)  p.,e,(.)ls.  ‘  \>  '  «p[51(«X>b) - sr  ‘  p<s.  >  *.(•)»■ 

1  s  -  — 

m  m 

Then  (2.5.25)  follows  from  (2.5.29)  and  (2.5.30). 


The  first  of  our  two  assertions  is  that 
(2.5.31)  Em^Q  (m)(exp(-51(m)Ry) )  -»  1  as  m  ->  cd 
uniformly  in  0  i.  y  <  cd  , 


where  ^  *  y  is  the  excess  over  the  horizontal 

boundary  with  level  y.  In  light  of  the  inequality 
1  1  )  1  -  T|  for  "H  i  0  and  (2.5.16)  it  is  enough 

to  show  that  the  nonnegative  quantity 


(2.5.32) 


sup{E  R 

m,91(m)  y 


0  £  y  <  co  } 


0(<r) 

m 


as  m  — >  cd 


Indeed,  for  0  (  y  <  co  we  find 


(Em,91(m)Ry)  *  Em,01(m)Ry 
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*  T  )  /Em,ei(m)X 


using  theorem  3  in  Lorden  (1970).  An  easy  dominated  convergence 
argument  shows  that  E 


(X+)3  -»  E(X+)3  <  oo  ,  and 


"m.e^m) 

by  (2. 5. 19), (2. 5. 12), (2. 5. 14), (2. 5. 10),  and  (2.5.6) 


Sm,ei(m)X  ’  V<ei<*»  -  ^(§1(m))  -  €m 


(2.5.33) 


(i  +  o(i))51(m)  -  em  -  (l  +  °(D)em. 


Therefore 


sup{E 


m,61(m)  y 


R„:  0  <  y  <  oo}  £  (1  +  o(l)) £  E(X+)3/Cn)l/2 


(2.5.34) 


0(6  ~1/2)  “  o(6 J1). 
m  m 


which  proves  (2.5.32). 

The  remaining  assertion  is  (2.5.30).  Standardizing  to  zero  mean 
and  unit  variance, 


S  -  mE  .  ,  -X 
m  m,9.(m) 

P_  ~  .  ^{S„  £  <XJ  -  P„  Q  ,_J - J72  C  ~um} 


m»®l(n>)  “  ■“  W  v) 

(m) 


with 


(2.5.35)  u.  - 


1/2 
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From  (2.5.33)  and 

(2.5.36)  Varm>e^(m)X  -  ^“(e^m))  -  ^"(S^m))  -  iT^Cm))  ->  1 
and  (2.4.8)  follows 

0C 

(2.5.37)  u  ~^n(6  — --)• 

m  mm 

Let  G  be  the  standardized  distribution  of  (-X)  under 

m 

P  .  .  .  ,  and  let  X_  be  the  associated  Cramer  series. 
m,0^(m)  m 

Were  it  not  for  the  dependence  of  on  m,  direct  application  of 

Cramer's  result  (which  holds  as  well  when  >  is  changed  to  i  on 
the  left  in  (1.2.7))  would  give 

3 

(2-5-38)  p.,9l(.)(s.  ‘  <*„>  -  plz  > 

In  fact,  (2.5.38)  can  be  established  by  rehashing  the  proof  of 
Cramer's  theorem  and  used  to  deduce  (2.5.30).  We  shall  follow  a 
somewhat  shorter  route  and  verify  (2.5.30)  directly,  but  our  proof,  like 
Cramer's,  will  be  based  on  the  standard  large-deviation  technique  of 
exponential  tilting. 

2.6.  A  Cramer- like  result. 

In  this  section  we  complete  the  proof  of  Lemma  2.4.1  by 
establishing  the  CramSr-like  result  (2.5.30). 
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For  abbreviation  we  Introduce  the  notation 

a 

(2.6.1)  $(m)  ■  t  (m)/v£  ■  ^n(6  +— ). 

m  mm 

This  should  cause  no  confusion;  after  all,  whenever  Lemma  2.4.1  Is 
applied  to  the  original  convergence  rate  problem  the  Identification 

(2.6.1)  Is  made.  In  addition,  let 

(2.6.2)  z  -  z(igl) 

m  vtn 

with  z  defined  In  (2.1.1)  and  put 


(2.6.3)  ©2(m)  -  zm  +  0o(m); 

we  shall  soon  tilt  from  P  .  .  .  toPQ/.  to  compute 

m,9^(m)  m,92(m)  r 

the  left  side  of  (2.5.30).  Observe 

(2.6.4)  z  -  5^-  o, 

m  Jol 

a 

(2.6.5)  02(m)  -  +  o(€m)  0, 

(2.6.6)  dm(02(m))  »  K(zm)  -  K(5Q(m))  -  €m©2(m). 

Also  note  that 


<2-6-7)  Em,e  («)X  ■  C<92<*»  ■  *'<*„>  -  «. 


%l-e 

vm 


m 


am 


m 


and  that 


Var  ,  .X 
m,92(m) 


dm"(02(m))  -  K"(zm)  1. 


(2.6.8) 
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We  are  going  to  use  Che  Berry-Esseen  theorem,  and  so  the 
absolute  moments 


(2.6.9) 


•  m 


Vv-)1*1' 


E |X| 3  < 


oo 


will  arise;  dominated  convergence  gives 

(2.6.10)  (>m  ->  (>. 


Let 


(2.6.11)  W 


m 


pm,e  (o)(Sm 


‘  V 


denote  the  left  side  of  (2.5.30).  Putting  ©"  •  ©^(m), 

©"  ■  ©2(m),  and  n  ■  m  In  (2.5.22)  yields 

(2.6.12)  -  exp{-m[^m(©1(m) )  -  ^m(©2 (“))]} 

J(-a».a^J  exp[(01(m)-02(m))s]  ^ 

Recalling  (2.5.16)  and  (2.6.6)  and  simplifying, 


(2.6.13) 


ffo  “  exP{_mt®mzm  "  K^m)]} 


J(-oo  ,<X  ]  eXpt(9l(m)  “  90(n> 

zn J 

Pm,©2(m){Sm  9  ds}’ 


zm>al 


third 


©  ds) 


If  the  approximation  ft  to  n  is  obtained  from  the  right 

mm 
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side  of  (2.6.13)  by  replacing  P  a  ,  s{S  €ds)  with  the 

1/2  ^ 

normal  distribution  P{(X  +  m  or  Z  6  da}  with  the  same  mean 

m  tn 

and  variance,  then  (completing  the  square  and  rearranging) 


1»m  «  exp[(61(m)  -  90(m))ao] 


(2.6.14) 


*  exp[-|morm2(01(m)  -  02(m)f]  P{Z  £  -Jnor^e^m) 

*  exp{m[K(zm)  -  zm  •  ^]} 
exp[(01(m)  -  0o(m))(Xtn] 

*  exp[-  ^?2(m)  +  h(^trm(ei(m)  -  92(m)))] 


where 


1  2 


(2.6.15)  h(t)  -  log(e^  •  P{Z  >  t}). 


Note 


(2.6.16) 


h'(t) 


-it2 

t  -  JLe  2  /P{z  >  t> 


■JEW* 


-  — 1  / 1  as  t  ->  oo  . 


So  by  the  mean  value  theorem 


h(^Torm(0l(m)  -  02(m)))  -  h(^(«o  -  ~)) 


©2(n))} 
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°L  CX 

-  -(1  +  o(l))vEtam(01(m)  -  02(m))  -  (®m  -  -  ~)] 

(2.6.17)  -  o(l), 

and  hence  Che  second  of  the  Chree  factors  on  the  right  side  of  (2.6.14) 
Is  (1  +  o(l))  times 


estimate 
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(2.6.19) 


Iff  -  ft  |  C  2C  • 
m  m 


^m 

crjfi 


exp[(61(m)  -  e0(m))am] 


•  exp{m[K(zm)  -  zm  • 


where  C  is  the  universal  constant  appearing  In  the  Berry-Esien 

3  1/2 

bound  C  •  p/(<r  n  ')  on  the  error  in  the  central  limit 
theorem  (see  Feller,  1971,  p.  542).  Comparing  with  (2.6.14)  and 
recalling  (2.6.15),  the  right  side  of  (2.6.19)  is 


p 

(2.6.20)  2C  •  . .  •  ft  exp[-h(JRr  (9,  (m)  -  9,(m)))]. 

j  7—  m  mi 

or  Ja 
m 

But 

(2.6.21)  exp[-h(t)]  as  t  a>  , 

so  by  (2.6.8),  (2.6.10),  and  (2.6.21),  (2.6.20)  is 

a  +  0(D)  •  2c?^r(em  -  -  „(»„). 

Together  n  -  ft  and  (2.6.18)  give  (2.5.30),  completing 
m  m 

the  proof  of  Lemma  2.4.1.  [] 


CHAPTER  III 


STRASSEN  AND  SIEGMUND  REVISITED 


3.1.  Introduction. 

The  following  theorem  on  boundary  crossing  probabilities  for 
Brownian  motion  B  is  due  to  Strassen  (1965,  cf.  thm.  1.2). 

As  in  Chapter  2,  let  g:  (0,  oo  )  (0,  oo  )  and  write 


g(t)  -  vPl(t) 

THEOREM  3.1.1  (Strassen).  Suppose  that  g  is  continuously 
differentiable  with 


(3.1.1) 


t 


as  t  t  for  some  X  >  0.  Assume  as  in  (2.1.8)  that 

(3.1.2)  g'(u)  -  g'(t)  as  u  -  t  -»  a) 
and  that  g  6  U(B).  Then 

(3.1.3)  P{B(t)  >  g(t)  for  some  t  i  s}  -  Jg  as  s  oo  , 
where 
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(3.1.4) 


J 

s 


r°°  i. 


-  ^P2(t) 


dt. 


D 


Approximating  the  random  walk  S  by  B  using  Skorohod  embedding, 
Strassen  deduced  Theorem  1.4.1.  For  Skorohod  embedding  Strassen  was 
able  to  show  that 

(3.1.5)  Sn  -  B(n)  +  O(n^^(log  n)1^(log  log  n)1^)  a.s.  as  n  -»  oo  , 
but  this  bound  is  crude  enough  that  Strassen  was  forced  to  impose  the 

3/5  -  y 

restriction  g(t)  (  t  in  place  of  the  more  natural  (cf. 

2/3 

(1.2.9a))  g(t)  ■  o(t  ).  Furthermore,  as  explained  in  Remark 
1.4.2(b),  Strassen's  use  of  the  Skorohod  technique  is  flawed. 

A  response  to  both  criticisms  is  provided  by  the  approximation 
scheme  of  Komlos,  Major,  and  Tusnady  (1975;  1976).  These 
authors  used  techniques  strikingly  different  from  those  of  Skorohod  to 
obtain  a  better  approximation  of  B  to  S.  The  resulting  improvement 
(Komlos,  Major,  and  Tusnady,  1976,  thm.  1)  to  (3.1.5)  is 

(3.1.6)  Sq  *  B(n)  +  O(log  n)  a.s.  as  n  oo  , 

and  Komlos  et  al.  showed  that  (3.1.6)  is  the  best  possible  result  in 

this  direction.  In  Theorem  3.2.1  we  use  Theorem  3.1.1  and  the 

KomlSs  et  al.  approximation  to  give  a  correct  proof  of  Theorem 

2/3 

1.4.1,  widening  the  range  of  boundaries  to  g(t)  ■  o(t  ). 

Corollary  3.2.3  provides  a  neat  summary  of  Theorems  2.1.1  and  3.2.1. 
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In  Section  3.4  we  generalize  Theorem  1.5.1  to  linear  boundaries  g 
with  nonzero  intercept. 

3.2.  Strassen  revisited. 


THEOREM  3.2.1.  Adopt  all  the  assumptions  of  Section  2.1  preceding 
(2.1.6),  relaxing  (2.1.3)  to 

(3.2.1)  472  t 
t 

and  tightening  (2.1.4)  to 

(3.2.2)  4^3  *° 


and  (2.1.5)  to  (3.1.2).  If  g  6  0,  then 


(3.2.3)  p  ~J  as  m  — >  oo 
m  m 

> 

with  given  by  (3.1.4).  Q 

REMARK  3.2.2.  (a)  As  with  Theorems  2.1.1  and  3.1.1,  the  various 

regularity  conditions  imposed  on  g  need  only  hold  for  large  t. 


(b)  We  remark  without  proof  that  condition  (3.1.2)  can  be  relaxed 
to  (2.1.5).  One  must  then  assume,  however,  that 


(3.2.4) 


g(t) 

(ty) 


1/2 


t 


for  some  p  or,  more  generally,  that 
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LEMMA  3.2.4.  Under  the  assumptions  of  Theorem  2.1.1,  (3.2.8) 
holds. 


PROOF .  According  to  (2.3.6)  and  the  normal  tail  estimate  (1.2.8), 


P{Sffl  >  g(m) }  -  P { 2  >  T(m)} 


with  I  defined  in  (2.3.7).  It  follows  (of.  (2.3.2),  (2.3.3), 
(2.1.6),  (2.1.5),  and  Lemma  2.2.1(j))  that 


and  hence  from  Lemma  2.3.1  and  (2.3.9)  that 


J OsL 


pm  -  a  +  0(1»  p{?(o)  <  z  ‘  *<v«» 


-  (1  +  o(D)  STi“2.-  i.  e  ^  (t)  ?'(t)  dt 
g©P'(m)  J  m  «/2ff 


-  <i  +  oo» 


dt 


(3.2.9) 


(1  +  o(l))(J  -  J  ). 

m  v 

m 


But  using  Lemma  2.2.1(h)  we  find 


J* "  Jm 


/  ^2(t)  dt 


i  (i  + 


^  IriiF 


-  ^p(t) 


$'<c)  dt 
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if00!-  z 

-  <1  +  °(i»  (1+iF>Jmire  r(t)  dt 

-  (1  +  0(1))  (1  +  ^)  P{Z  >  ?(m)} 

-  (1  +  o(l))  (1  +  j£)  P{Sm  >  g(m) } 

and  similarly 

3  i  (1  +  o(l))2P{S  >  g(a)>. 

m  m 

Hence 

(i  +  o(D)  — *  Ja/pa  <  a  +  o(D)  j  (i  + 

1  +  is 

Thus 

(3.2.10)  o  <  Jv  <  (l  +  o(i))  y  a  +75-)  Pv  *  o(pm)  -  o(j  ) 

m  m 

and  (3.2.8)  follows  from  (3.2.9-10).  (] 

We  now  undertake  the  proof  of  Theorem  3.2.1.  Using  the 
approximation  of  Komlos  et  al.  we  shall  prove  in  Section  3.3  the 
following  strong  invariance  principle  to  be  used  in  conjunction  with 
Theorem  3.1.1. 

Extend  the  time  domain  of  S  to  [0,  co  )  via  the  definition 

(3.2.11)  S(t)  “  s|_t  I  .  0  £  t  <  co  . 
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LEMMA  3.2.5.  Adopt  the  assumptions  of  Theorem  3.2.1.  Choose 
(X  >  0  and  put 

(3.2.12)  h  -  ga. 

Then,  without  loss  of  generality,  there  exist  a  Brownian  motion  B  and 
positive  constants  K  and  M  such  that 

(3.2.13)  P{ I S(t)  -  B ( t ) I  >  h(t)  for  some  t  i  s}  £  Ke_Mh(a) 

for  all  s  >  0.  [] 

REMARK  3.2.6.  (a)  The  phrase  "without  loss  of  generality"  is  to  be 

interpreted  in  the  specific  sense  of  Strassen  (1965):  there  exists  a 
probability  space  on  which  S  and  B  are  both  defined  and  (3.2.13) 
holds.  The  proof  of  Theorem  3.2.1  then  applies  to  this  version  of  S, 
but  p^  -  P{Sn  ^  g(n)  f°r  some  n  i  m}  is  clearly  the  same  for  any 
version  of  S. 

(b)  The  constants  K  and  M  depend  on  F,  g,  and  (X.  Q 

In  proving  Theorem  3.2.1  we  apply  Lemma  3.2.5  with  (X  ■  1/2. 

First  observe  that 

Pm  ■  P{S(t)  >  g(t)  for  some  t  i  m}. 

Hence 

P{B(t)  >  g(t)  +  h(t)  for  some  t  1  m}  -  Ke 

£  Pm  i.  P { B ( t )  >  g(t)  -  h(t)  for  some  t  i  m}  +  Ke 
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1/2 

with  h  *  g  .  Now 

pm  i  P{Sm  >  g(m)}  -  (1  +  o(l))[^Wi(m)]"1exp[-  ^(m)] 
-  exp[-(l  +  o(l)>|l?2(m)] 

and  by  (3.2.2) 

?2(m)  -  o(h(m) ) , 


(1  +  o(l))P{B(t)  >  g(t)  +  h(t)  for  some  t  1  m} 


(3.2.14)  i.  p^  C  (1  +  o(l))P{B(t)  >  g(t)  -  h(t)  for  some  t  i  m}. 

To  complete  the  proof  of  Theorem  3.2.1  we'll  show  that  both  of  the 

Brownian  motion  probabilities  in  (3.2.14)  equal  (1  +  o(l))  J  . 

m 

Begin  with  the  right  side  of  (3.2.14).  Because  g  fi  0  and 
ft,  the  KPEF  test  (see  Section  1.3)  implies  that 


-  |?2(t) 


dt  <  oo  . 


Since  g  -  h  satisfies  hypothesis  (3.1.1)  of  Theorem  3.1.1  with 
X  *  1/2  and  by  (3.2.2) 
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Hc)g±.£!hi i.0(l). 

it  follows  from  the  KPEF  test  that  g  -  h  6  U.  Moreover,  g  -  h 
also  satisfies  (3.1.2)  because 

»'<*>  ‘  0)-  “(*'<'»• 

Thus  (3.1.3)  holds  for  g  -  h: 

P(B(t)  >  g(t)  -  h(t)  for  some  t  i  m} 

•  +  o(l»  J®  jfc  SrSt^'W  e*p{-  ^T(t)  - 

(3.2.15)  *  (1  +  o(l))J  as  m  co  . 

m 

As  for  the  left  side  of  (3.2.14),  we  note  that  g  e  U  and 

g  +  h  i  g  implies  g  +  h  6  U,  whether  U  equals  U(S)  or 

U(B).  The  function  g  +  h  also  satisfies  the  assumptions  of  Theorem 

3.1.1,  but  now  we  need  to  take  X  *  1/4.  Proceeding  just  as  with  the 

right  side,  we  find  the  left  side  of  (3.2.14)  to  be  (1  +  o(l))  J  , 

m 

and  the  proof  is  complete.  Q 
3.3.  A  strong  Invariance  principle. 

Using  the  standard  probability  estimate 

lB<t)  -  B<">1  >  «>  -  p<oSi  >  *> 

(3.3.1)  £  4P{B(1)  >  x} 

for  the  uniformity  of  B  over  an  interval  [n,  n  +  1]  of  length  one, 


dt 
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one  can  prove  without  difficulty  the  following  slight  extension  of 
theorem  1  in  Komlos,  Major,  and  Tusnady  (1976). 

LEMMA  3.3.1.  Let  S  satisfy  the  assumptions  of  Section  2.1, 
including  restriction  to  the  mgf  case.  Then,  without  loss  of 
generality,  there  exist  a  Brownian  motion  B  and  positive  constants 
a,  b,  c  depending  only  on  F  such  that 

(3.3.2)  P(0J3s  I  S( t)  -  B(  t)  |  >  x}  <  asbe~cx 
for  all  x  >  0  and  s  i  1.  Q 

With  this  result  we  are  now  ready  for  the 

PROOF  of  Lemma  3.2.5.  We  may  suppose  that  s  >  1  and  define 
(  i  0  from  s  by 

(3.3.3)  2^<s(2^+1. 

Then  by  Lemma  3.3.1  and  the  increasingness  of  h 

p(s)  =  P { I S( t )  -  B( t) I  >  h(t)  for  some  t  i  s} 

£  P(Ujijf  { I S(t)  -  B( t)  I  >  h(t)  for  some  2j  <  t  £  2J+1}) 

£  I*.,  P{  3UP  IS(t)  -  B(t) I  >  h(2J)} 

J  ^  0iti2J  1 

*  al  expfb^ j  -  c1h(2-')] 

with  a^  »  a  *  2b,  b^  ■  b  •  log  2,  and  - 


c.  According 


to  (3.2.1).  h(2J)  i  h(l)  •  2  J  for  all  j  i  0,  and  so 

(3.3.4)  implies  that  for  c2  -  c^/2  and  some  a2  >0 

p(s)  ^  a2  exp[-c2h(2^)] 

£  a2  exp[-c2h(2t)]  dt 

-  i^fr  U_1  «pt-c2h(u)1  du* 

Now  by  (3.2.12)  and  the  fact  g'(u)  i  g(u)/(2u)  (cf.  Lemma  2.2.1(b)) 

h'(u)  -  (Xh(u)|^-  >  \  a 

hence  with  a 3  -  (2a2)/((Xc2h(l/2)log  2}  >  0  we  find 
p ( s )  £  a^  J  exp[-c2h(u)]  c2  h'(u)  du 


a2exp[-c2h(2^  ^)] 


i  a3exp[-c2h(-js)]  • 

_ 20C/  3 

(3.2.13)  now  follows  with  K  -  a3  and  M  -  4  c2  from 
(3.2.2).  0 
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3.4.  Siegmund  revisited. 

In  solving  the  convergence  rate  problem  in  the  SLLN  case  (see 
Section  1.5),  Siegmund  (1975)  treated  linear  boundaries  g(t)  •  Ct 
that  pass  through  the  origin  and  have  positive  slope  C.  Our  proof  of 
Theorem  2.1.1  used  Siegmund' s  techniques  to  establish  the  key  Lemma 
2.4.1,  which  estimates  crossing  probabilities  for  boundaries 
jfffl(t)  ■  +  €mt  having  variable  intercept  and  slope 

*  o(l).  The  provability  of  Lemma  2.4.1  leads  to  the  conjecture  that 
Siegmund's  result  can  be  extended  to  the  case  g(t)  *  OC  +  Ct.  The 
purpose  of  this  section  is  to  state  such  an  extension,  Siegmund's  proof 
adapting  easily  to  our  generalization. 

Accordingly,  let  C  >  0  and  (X  0  R  be  given,  and  denote 
by  g  the  straight  line 

(3.4.1)  g(t)  -  a  +  Ct. 

Our  goal  is  to  determine  the  asymptotic  rate  at  which  the  boundary 
crossing  probabilities 

(3.4.2)  p^  ■  P{Sn  >  g(n)  for  some  n  i  m} 

converge  to  zero  as  m  — >  oo  when  S  satisfies  the  assumptions  of 
Section  1.1. 

We  need  some  notation.  Define 


Ke<5>  -  K<5)  - 
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for  $  in  the  interval  of  finiteness  of  K.  As  at  the  start  of  the 

proof  of  Lemma  2.4.1  it  follows  that  if  £  is  small  enough  then  there 

exists  a  unique  nonzero  value  ,  necessarily  positive,  for  which 

Kg(J^)  -  0.  Assume  that  such  a  value  5^  exists  —  this 

is  the  criterion  of  "smallness"  for  £  mentioned  in  Section  1.5  — 

and  let  $g  denote  the  point  in  (0,  J^)  at  which 

Kg'  vanishes.  In  the  notation  of  (2.1.1),  5q  *  z(€). 

As  (2.5.17)  and  (2.5.11)  defined  P  n,  so  let  ?n  denote  the 

m,u  u 

probability  under  which  X^,  X2,  ...  are  i.i.d.  with 

(3.4.3)  PQ{X  e  dx}  -  exp(5Qx  -  Kg(50)]  F(6  +  dx); 

PQ  has  cgf 

(3.4.4)  tf(6)  -  Kg(50  +  0)  -  Kc(50), 
mean 

(3.4.5)  <T(0)  -  Kg'(50)  -  0, 
and  variance 

(3.4.6)  or2  =  «T( 0)  -  Kg"(50)  -  K"(5Q). 

Put 

(3.4.7)  90  -  -5q  <  0; 


then  (cf.  (2.1.1)) 
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(3.4.8)  rf<eQ)  -  -Kc<50)  “  “  «3X(6). 

We  further  assume  that  either 

(3.4.9)  F  is  non-lattice 

or 

(3.4.10)  F(C  +  •)  Is  a  lattice  distribution  supported  by 

{0,  ±h,  ±2h,  ...}  and  with  span  h  >  0,  and  0C  *  hk  for 
some  (positive,  negative,  or  zero)  integer  k. 

In  Siegmund's  paper  the  necessary  (see  the  proof  of  his  lemma  1) 
assumption  that  h  is  the  spaa  of  F(6  +  •)  (rather  than  just  a 
multiple  thereof)  is  tacit.  The  distinction  is  not  merely  academic,  for 
if,  for  example,  F  is  a  lattice  distribution  supported  by  the 
multiples  of  h  and  with  span  h,  then  (3.4.10)  implies  S  1  h. 

For  another  illustration,  in  the  case  of  symmetric  Bernoulli  components 
(  X  •  ±1  with  probability  1/2  each),  Siegmund's  theorem  and  the 
present  generalization  are  vacuuous(i),  for  the  smallest  possible  value 
6*1  still  does  not  satisfy  the  above  criterion  of  smallness. 

Generalizing,  and  simplifying  the  statement  of,  theorem  1  in 
Siegmund  (1975)  we  have 

THEOREM  3.4.1.  Using  the  preceding  assumptions  and  notation,  as 


m  ->  oo 
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(3.4.11)  pm  -  exp{^“1  n'1  [e^Vp^  >  0}  -  P{Sn  >  €n}]} 

•  P{S  >  g(m) } 
m 


and  furthermore 


(3.4.12)  P{Sm  >  g(m) }  -  C(2ffcr2m)"1/2exp(-m^(e0)  +  0O(X), 
where  if  (3.4.9)  holds 


(3.4.13)  C  -  1 / I ©o I 
and  if  (3.4.10)  holds 


(3.4.14)  C  -  h/[exp(|eolh)  -  1].  0 


Notice  that  the  only  factor  in  (3.4.11-12)  depending  on  0C 


exp(eQ00 . 


CHAPTER  IV 


TWO  REMARKS 

4.1.  Introduction. 

In  Section  1.4  we  stated  that  Strassen  (1965)  determined  the 
probabilities 

Pm  ■  P{Sn  >  g(n)  for  some  n  >  m} 

up  to  a  factor  (1  +  o(l))  under  certain  conditions  on  g.  In  fact, 
Strassen's  theorem  1.4  specifies  the  convergence  rate  for  the 
probabilities 

P{Sn  i  g(n)  for  some  n  >  m}. 

The  results  are  the  same:  in  Section  4.2  we  show  that  under  the 
assumptions  of  Corollary  3.2.3  the  four  probabilities 

(4.1.1)  p  ■  P{S  >  g(n)  for  some  n  i  m}, 

m  n 

(4.1.2)  p  . ,  *  P{S  >  g(n)  for  some  n  >  m}, 

nrri  n 

(4.1.3)  p*  ■  P{Sq  i  g(n)  for  some  n  i  m}, 

(4.1.4)  p*  -  P{S  i  g(n)  for  some  n  >  m} 

art-i  n 
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are  all  asymptotically  equivalent,  i.e.,  equal  up  to  a  factor 

(1  +  o(l)),  as  m  oo  .  Section  4.2  also  contains  the  noteworthy  result 

that,  under  the  assumptions  of  Theorem  3.4.1,  p^  and  ^  are 

not  asymptotically  equivalent;  furthermore,  that  p*  -  p^ 

only  in  the  non-lattice  case  (3.4.9). 

Section  4.3  is  concerned  with  an  entirely  different  matter.  Let 
S  satisfy  the  assumptions  of  Section  1.1  and  let  g  be  a  boundary 
belonging  to  both  U(S)  and  U(-S).  According  to  the  definition  of 
upper  class  boundaries  the  probabilities  (4.1.1)  and 

Pm  E  P{Sq  <  -g(n)  for  some  n  i  m} 

both  vanish  as  m  oo  .  The  double-boundary  crossing  probability 

(4.1.5)  P { I S n I  >  g(n)  for  some  n  i  m} 

is  majorized  by  p  +  p  and  so  is  also  small  when  m  is  large, 

m  m 

One  expects  that  the  probability  of  the  event  that  S  crosses  one  of 
the  boundaries  ±g  after  time  m  and  then  alters  its  course  so 
drastically  as  to  cross  the  other  is  of  smaller  order  of  magnitude  than 

(4.1.5) .  This  is  indeed  the  case,  as  the  following  theorem,  proved  in 
Section  4.3,  demonstrates: 

THEOREM  4.1.1.  Let  b  e  U(S),  a  e  U(-S),  with  a  f, 

P{Sn  >  b(n)  or  Sn  <  -a(n)  for  some  n  i  m} 


b  t •  Then 
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(4.1.6)  -  P{Sn  ^  k(n)  for  some  n  i  m} 

+  P{S  <  -a(n)  for  some  n  >  m).  Q 
n 

Section  4.3  also  discusses  some  consequences  of  Theorem  4.1.1  when 
a  -  b  -  g. 


4.2.  >  versus  i  . 

Suppose,  as  In  the  hypotheses  of  Corollary  3.2.3,  that  the 
1  /2 

boundary  g(t)  *  t  *f(t)  satisfies  the  growth  conditions 

(4.2.1)  t  oj  ,  I  0 

as  t  t  03 .  Cramer's  result  (1.2.7)  can  be  stated  in  the  form 

(4.2.2)  P{Sm  >  g(m) }  -  P{Z  >  f(m)}  as  m  03 

with  defined  by  (2.3.7).  Now  as  t  co  , 

'f(t)  -  f(t)  oo  and,  by  a  minor  adjustment  to  the  proof 

of  (2.3.10),  f2(t  +  1)  -  f2(t)  0.  It  follows 

using  the  normal  tail  estimate  (1.2.8)  and  Lemma  2.2.1(h)  (whose  proof 
requires  only  (4.2.1))  that  as  m  ->  ao 

(4.2.3)  PfS^  >  g(m+l ) }  -  P{Sffl  >  g(m) } . 

The  strong-law  counterpart 

(4.2.4)  p^  -  pm 

to  (4.2.3)  is  established  using  the  result 
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(4.2.5) 


m 


J 

m 


Too  i 

J  m 


g^t) 

Jt 


-  ^f2(t) 


dt 


of  Corollary  3.2.3.  Indeed  it  is  enough  to  recall  that 
f2(t  +  1)  -  ?2(t)  -¥  0  and  to  observe  from  (3.1.2) 

that  g'(t  +  1)  -  g'(t)  as  t  oo  . 


We  next  consider  boundary  crossing  probabilities  in  which  the 
definition  of  the  event  {  S  crosses  g  at  time  n  }  is  changed  from 
the  strict  inequality  {Sn  >  g(n)}  to  {Sn  >  g(n)}.  If  the 
component  distribution  F  is  continuous,  i.e.,  has  no  point  masses, 
then  clearly  i  g(m)}  -  P{Sm  >  g(m)}  and  p*  -,Pm 

for  every  m,  irrespective  of  the  properties  of  g.  We  now  show  that, 
regardless  of  the  form  of  F,  these  equalities  hold  asymptotically  as 
m  ->  ao  ,  i.e. , 

(4.2.6)  P{S  i  g(m) }  -  P{S  >  g(m) } 

UA  m 

and 


assuming  the  hypotheses  of  Cramir's  theorem  (1.2.7)  and  Corollary 
3.2.3,  respectively. 


The  truth  of  (4.2.6)  is  not  surprising,  as  (4.2.2)  suggests  the 
result 

P{Sm  i  g(m) }  -  P{Z  i  f (m) } 

and  Z  has  a  continuous  distribution.  A  straight-forward  way  of 
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proving  (4.2.6)  is  to  note  that  the  proof  of  Cramgr's  theorem 
carries  through  as  well  for  the  non-strict  crossing  probabilities 
P{Sm  i  g(m)}  as  for  P{Sn  >  g(m)}.  An  alternative  technique,  to  be 
used  also  in  deducing  (4.2.7)  when  Theorem  2.1.1  is  in  force,  is  to 
bound  {S^  i  g(m)}  according  to 

(4.2.8)  {Sm  >  g(m) }  C  iSm  i  g(m)}  C  {Sm  >  g"(m)}, 

where  g  minori2es,  but  closely  approximates,  g.  A  suitable  choice 
for  g  is 

(4.2.9)  g-(t)  -  [1  -  l/f3(t)]  •  g(t). 

Clearly  f“(t)  -  ?(t)  cn  as  t  a>  ,  where 

g  (t)  5  t1^2^  (t).  Moreover,  with  f  given  by  (2.4.23) 
and  f  (t)  =  [tf(?  (t)/t1/2)]1/2  in  accordance 
with  (2.3.7), 

?2(t)  -  (f'(t))2  -  t[f(^|i)  -  f(^^-)] 


-  (1  +  o(l))t 


4— xH- 


(t) 

X" 


"(t)  V 

IT" > 


-  (1  +  0(D)  •  2*(t)(¥(t)  -  r(O) 


-  (1  +  0(D)  •  2/f(t)  -  Od) 


as  t  oo  .  From  CramSr's  theorem  it  follows  that 
(4.2.10)  P{Sm  >  g’(rn)}  -  P{Sm  >  g(m) } ; 


in  light  of  (4.2.8),  (4.2.6)  holds. 
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Turning  to  (4.2.7),  we  consider  separately  the  two  results. 
Theorems  3.2.1  and  2.1.1,  from  which  Corollary  3.2.3  follows.  For 
Theorem  3.2.1  we  rehash  the  proof;  for  Theorem  2.1.1  this  method  is 
direct  but  laborious  and  we  choose  to  employ  instead  the  approach  of 
(4. 2. 8-9). 


Theorem  3.2.1  is  derived  from  the  corresponding  result,  Theorem 
3.1.1,  for  Brownian  motion.  But  as  a  consequence  of  Ylvisaker's  (1968) 
theorem. 


P{B(t)  i  g(t)  for  some  t  i  s) 

(4.2.11)  -  P{B(t)  >  g(t)  for  some  t  i  s} 

for  every  s,  so  Theorem  3.2.1  for  p+  follows  in  the  same  way 
as  for  p^,  and  (4.2.7)  holds. 


Routine  calculations  show  that  the  function  g  of  (4.2.9) 
satisfies  the  hypotheses  of  Theorem  2.1.1,  with  the  possible  exception 
of  the  monotonicity  of  g  (t)/t,  and  that  (g  )'(t)  -  g'(t)  and 
(V  )'(t)  -  ?'(t)  as  t  oo  .  In  addition,  Lemma  2.2.1 
continues  to  hold  true  —  some  parts  in  negligibly  weaker  form,  such  as 
(g  )'(t)  £  (1  +  o(l))g  (t)/t  for  the  second  half  of  (b)  —  and  it 
is  not  hard  to  see  that  the  conclusion  (2.1.6)  holds  for  g  .  Thus  by 
(4.2.8)  and  (4.2.10), 


Pm  ^  (1  +  0(D)  Is  p{s  >  g”(m) ] 

“  m  vteCP  )'(m)  m 
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-  (1  +  o(D)  f^-m) ,  P{Sm  >  g(m)} 

(m)  m 

-  (1  +  o(D)Im  -  (1  +  o(D)pm, 

and  (4.2.7)  holds. 

Summarizing,  when  the  assumptions  of  Corollary  3.2.3  are  in  force, 
the  four  weak-law  probabilities 

(4.2.12)  P{Sm>g(m)},  PtS^  >  g(m  +  1)}, 

0 

P{Sm  i  g(m)},  PfS^i^+l)} 

are  all  asymptotically  equivalent,  as  are  (4. 1.1-4). 

The  results  are  quite  different  when  the  hypotheses  of  Theorem 
3.4.1  are  assumed.  In  particular,  it  is  immediately  clear  from 
(3.4.11-12)  that  (4. 2. 3-4)  fail  in  this  case.  Furthermore,  (4. 2. 6-7) 
are  true  if  (3.4.9)  holds  but  false  if  (3.4.10)  holds.  This  can  be 
verified  by  reviewing  the  proof  of  Theorem  3.4.1,  but  we'll  introduce 
minorizing  approximants  to  g  and  proceed  as  in  (4.2.8). 

Assume  first  that  (3.4.9)  holds.  For  S  >  0  let 

gg(t)  »  OC  -  5  +  €t. 

Apply  (3.4.12)  to  both  g  and  gg  and  use  (4.2.8)  (with 
8  *  gg  )  to  conclude 

(P{S  }  g(m)}/P{S  >  g(m)}) 

in  m 


^  ^  lim  inf 
m  — ^  cd 
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i  <P{Sm  *  >  g(B>»  <  exp(|e0!S). 

Now  let  6  4  0  to  get  (4.2.6).  Relation  (4.2.7)  is  proved  in 

the  same  way. 

Approximations  gg  to  g  with  ever  increasing  precision  are 
not  available  in  the  lattice  cast  (3.4.10),  due  to  the  restriction  of 
OC  to  integer  multiples  of  h  ■  (span  of  F  ).  In  fact, 

(4.2.13)  P{Sm  }  g(m) }  -  P{Sm  >  OC  -  h  +  6m}  -  exp( |eQ | h)P{Sm  >  g(m)} 
and  similarly 

(4.2.14)  p+  «  P{S  >  OC  -  h  +  fin  for  some  n  i  m}  -  exp(i9_|h)p  . 

m  n  u  m 

4.3.  Two  boundaries. 

The  first  task  of  this  section  is  the 

PROOF  of  Theorem  4.1.1.  The  right  side  of  (4.1.6)  overestimates 
the  left  by 


P{Sn  >  b(n^)  for  some  n^  i  m,  Sn  <  -a(n2)  for  some  ^  i  m} 


(4.3.1)  £  P{Sp  >  b(p),  Sq  <  -a(q) 

for  some  p,  q  satisfying  q  >  p  i  m} 

+  P{Sp  <  -a(p),  Sq  >  b(q) 

for  some  p,  q  satisfying  q  >  p  i  m}. 

Let  us  examine  the  first  term,  which  equals 
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(4.3.2)  -^n  HSn  £  b(n)  for  m  £  n  <  p,  Sp  >  b(p), 

for  some  q  >  p}. 


The  summand  here  is 

P{Sn  £  b(n)  for  m  £  n  <  p,  Sp  >  b(p)} 

times 

P{S^  <  -a(q)  for  some  q  >  p  I  Sr  £  b(n)  for 

Sp  >  b(p) } 

£  P{S^  -  Sp  <  -(a(q)  +  b(p))  for  some 

q  >  p  I  S^  £  b(n)  for  m  £  n  <  p,  Sp 

■  P{S^  -  Sp  <  -(a(q)  +  b(p))  for  some  q  > 

■  P{-Sk  >  a(k  +  p)  +  b(p)  for  some  k  i  1}; 
the  sum  (4.3.2)  is  therefore  majorized  by 

P{S  £  b(n)  for  m  £  n  <  p,  S  >  b(p)} 
~p*m  n  P 

■  P { S  >  b(n)  for  some  n  i  m} 

n 

times 

(4.3.3)  Pf-S^  ^  a(k)  +  b(m)  for  some  k  i  1}. 


q  < 


m  £  n  <  p, 


b(  p)  } 


We  shall  show  that  the  probabilities  (4.3.3)  decrease  to  0  as 
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m  f  cd  ,  so  that 

(first  term  on  right  in  (4.3.1)) 

/ 

(4.3.4)  •  o(first  term  on  right  in  (4.1.6)). 

In  just  the  same  way 

(second  term  on  right  in  (4.3.1)) 

(4.3.5)  ■  o(second  term  on  right  in  (4.1.6)) 

and  the  proof  of  (4.1.6)  is  complete. 

Since  b(m)  f  oo  as  m  f  oo  (as  follows  from 
b  6  U(S)  ),  the  events  in  (4.3.3)  decrease  to 

(4.3.6)  {-S^  ^  a(k)  +  n  f°r  some  k  i  1} 

Because  a  e  U(-S), 

(4.3.7)  P(U^^  ^_Sk  ^  aW  for  a^-1  ^  >  k*})  ■  1. 

Hence  the  event  (4.3.6)  has  the  same  probability  as 


^k*-l  ^n-l  ^~^k  ^  a^^  +  n  ^or  30me  1  £  k  £  k*} 


U£-l  ~n«l  <iXk*  <‘V  >  “  *- 


namely,  zero.  Q 
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One  might  ask  how  the  two  terms  on  the  right  In  (4.1.6)  compare  In 
size.  We  shall  answer  this  question  assuming  that  a  ■  b  ■  g  satisfies 
the  hypotheses  of  Corollary  3.2.3,  l.e.,  of  either  Theorem  3.2.1  or 
Theorem  2.1.1. 


Recall  that  Theorem  3.2.1  is  an  invariance  principle:  the 
asymptotic  rate  at  which  the  probabilities 

Pm  ■  ptsn  ^  8(n)  for  some  n  i  m) 

decrease  to  zero  is  the  same  for  any  random  walk,  including  (-S).  Thus 
each  term  on  the  right  in  (4.1.6)  equals  (1  +  o(l))  as  m  oo  , 
where  J#  is  defined  by  (3.1.4). 

In  the  case  of  Theorem  2.1.1  we  need  only  to  examine  the 
corresponding  weak-law  probabilities  using  (1.2.7): 

(4.3.8)  p(sm  ^  g(®)}  ~  p{Z  >  fKn)}expf^(m)tyj^X(tyj^)) , 


(4.3.9)  Pf-S^  ^  “  P(Z  >  1p(m) }exp[-ty2(m) ~j[~~X(~  ^£~) ] , 

(4.3.9)  holding  since  (-X)  has  CramSr  series  (— X(~( ')))•  The 
ratio  (4.3-8)/ (4.3.9)  is 


(4.3.10)  rm  i  exp{l?2(m)Hf-[X(Hgl)  +  \(-  %^)]}- 


In  light  of  (2.1.6)  r  is  also  the  ratio  of  the  first  term  on  the 

m 

right  in  (4.1.6)  to  the  second  when  a  M  b  »  g  satisfies  the  hypotheses 


of  Theorem  2.1.1. 
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If  t(m)  -  o(m^^),  i.e.,  if  g(m)  ■  then 

r  ->  1,  in  accordance  with  ouf  investigation  for  Theorem  3.2.1. 

m 

3 

Likewise,  provided  the  constant  term  Xq  *  EX  /6  in  the 
CramSr  series  X  vanishes  (in  particular,  if  X  has  a  symmetric 
distribution  function  F  ),  then 


(4.3.11) 


-  exp{2X2 


?5(m)1 
3/2  * 


tends  to  unity  whenever  f(m)  »  o(m3/,1()). 

Suppose  Xq  i  0  and  m1//6  -o(?(m)).  Then 
CD  > 

(4.3.12)  r  according  as  Xo  0. 

m  0  ^  < 

The  result  (4.3.12)  agrees  with  intuition:  If  the  distribution  of  the 

component  X  has  a  long  right-hand  tail,  as  is  typically  the  case  when 

n 

EX  ■  0,  Var  X  *  1,  and  EX  >  0,  large  positive  deviations  for  S 
are  more  likely  than  large  negative  deviations. 

We  can  refine  (4.3.11-12).  For  example,  if  ?(m)  ■  o(m^^^), 

then 


(4.3.13) 


m 


exp { 2  Xq 


y3(m) 


+  0( 


f3(m) 

3/2 


)}, 
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5/14. 


regardless  of  the  sign  of  Xq.  If  f(n>)  *  o(m  ) 

......  ?3(m)  .  ..  |5(m)  .  nf?7(.)„ 

(4.3. !4)  -  exp {2^-^— +  2X2-372“  + 


and  so  on. 


then 


CHAPTER  V 


ASYMPTOTIC  EXPANSIONS 

5.1.  Introduction. 

As  defined  in  Section  1.2,  the  problem  of  convergence  rates  related 
to  the  WLLN  is  to  determine  the  probabilities 

(5.1.1)  P(Sm>g(m)} 

up  to  a  factor  (1  +  o(l))  as  m  oo  when  the  random  walk  S 
satisfies  the  assumptions  of  Section  1.1  and  the  boundary  g  satisfies 

(5.1.2)  f(m)  -  oo 

as  m  qo  .  A  solution  to  this  problem  is  by  definition  an 

approximation  to  (5.1.1)  whose  relative  error  tends  to  zero  as  m  a> . 

We  can  guarantee  that  this  first-order  approximation  is  of  a  specified 

quality  by  determining  an  asymptotic  upper  bound  on  the  rate  at  which 

the  relative  error  vanishes.  The  guaranteed  accuracy  of  a  second-order 

approximation  whose  relative  error  is  known  to  vanish  more  quickly  than 

this  bound  is  greater.  Iterating  this  idea  we  arrive  at  the 

problem  of  asymptotic  expansions  related  to  the  WLLN:  to  develop  an 

asymptotic  expansion  for  (5.1.1)  when  (5.1.2)  obtains.  In  Section  5.2 

1/2 

we  present  a  solution  in  the  Cramgr  case  ?(m)  *  o(m  '  ), 
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1/2 

f  (m)  ->  od  .  The  solution  in  the  WLLN  case  ?(m)  «  cm 

is  due  to  Bahadur  and  Ranga  Rao  (1960)  and  is  omitted  here. 

The  problem  of  asymptotic  expansions  related  to  the  SLLN  is  to 
derive  an  asymptotic  expansion  for  the  probabilities 

(5.1.3)  p  -  P{ S  >  g(n)  for  some  n  i  m} 
m  n 

when  g  6  U(S).  The  dominant  term  in  such  an  expansion  is  given  for 

1  /2 

boundaries  $(t)  “  g(t)/t  of  slow,  moderate,  and  rapid  growth 
in  Theorems  3.2.1,  2.1.1,  and  3.4.1,  respectively,  the  slow  and  moderate 
ranges  overlapping  considerably.  In  Section 

5.3  we  obtain  a  partial  asymptotic  expansion  of  p^  for  boundaries  of 
slow  growth.  This  expansion  is  of  higher  order  the  more  slowly  $ 
increases  and  in  fact  is  complete,  i.e.,  of  infinite  order,  when  $ 
is  not  also  in  the  range  of  moderate  growth.  In  Section  5.4  we  derive 
an  asymptotic  upper  bound  on  the  relative  error  in  the  approximation  of 
Theorem  2.1.1  for  boundaries  of  moderate  growth.  The  problems  of 
finding  even  the  second  term  in  an  asymptotic  expansion  of  p^  for 
f  in  the  moderate  range  and  of  bounding  the  error  in  the 
approximation  of  Theorem  3.4.1  remain  open. 

5.2.  An  asymptotic  expansion  related  to  the  WLLN. 

Throughout  this  section  we  assume  (5.1.2). 
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We  begin  with  an  examination  of  the  case  F  ■  <£  of  standard 
normal  components.  In  this  case  the  probability  (5.1.1)  equals 
P{Z  >  $(m)},  and  the  solution  to  the  convergence  rate  problem  is  given 
most  simply  by  (1.2.8).  The  solution  to  the  asymptotic  expansion 
problem  is  equally  well  known  (Feller,  1968,  p.  193):  for  any  n  i  0 


(5.2.1)  P{Z  >  t(m) } 


[*5n?(m)j  1exp[-  ifZ(m)] 


lm2, 


[1  - 


1  1*3  _  1-3-5 

?2(m)  f4(m)  ?^(m) 


+  (-D 


n  1-3- 


f2tl(m) 


f2n(m) 


)] 


as  m  — >  ao  . 

For  general  F  the  solution  presented  here  to  the  asymptotic 
expansion  problem  in  the  Cramer  case 

(5.2.2)  HT(m)  -  o(v£) 

is  due  to  Saulis  (1969);  see  also  Ibragimov  and  Petrov  (1971,  p.  416). 

In  addition  to  the  assumptions  of  Section  1.1,  suppose  that  F  has 
characteristic  function  Eexp(itX)  satisfying 

(5.2.3)  |EeltX|  <  1> 

the  so-called  CramSr's  condition  (C).  The  condition  (5.2.3)  holds, 
in  particular,  if  F  has  a  nonzero  absolutely  continuous  component  and 
fails  if  F  is  a  lattice  distribution.  Then  for  any  n  i  0 


87 


(5.2.4)  P{Sm  >  g(m) }  -  P{Z  >  f (m) }  •  exp[?2(m)I^X(^-) ] 

'  [£0  Qk(»)  + 


as  m  ->  ao  ,  where  the  functions  Qq,  Q^,  ...  satisfy 
(5.2.5)  Qk(»)  =■  0((^-)k),  k)0. 

Saulis  (1969)  gives  explicit  formulas  for  the  functions  Qfc.  In 
particular , 


(5.2.6a)  QQ(m)  *  1* 

reducing  (5.2.4)  to  (1.2.7)  in  the  case  n  *  0.  For  n  »  1  the 
approximation  (5.2.4)  fares  better  than  is  guaranteed  by  (5.2.5): 
Q^(m)  SO  if  Xq  *  0  and 


Q1(m) 


^0  #  ^f2(m)  -  1  e-  |f2(m) 

Jn  ^F?{Z  >  t(m)} 


-  f3(m) ] 


(5.2.6b)  -  -(1  +  0(1))  3Xq  — \ ~= 

v  f(m )Ja 


otherwise. 


In  the  language  of  asymptotic  series,  Qk(m)  is  an 
asymptotic  expansion  for 

P{Sm  >  g(m) } 

P{Z  >  f(m)}  •  „pIf2C)^?-X(^)T 

Vtn  vm 


as  m  4  oo  with  respect  to  the  auxiliary  asymptotic  sequence 
(('>(m)/m1//^)tl)> 
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The  normal  probability  on  the  right  in  (5.2.4)  can  be  expanded 
using  (5.2.1).  The  power  series  representation  of  \  yields  an 
asymptotic  expansion  for  the  argument  of  the  exponential  factor  on  the 
right  in  (5.2.4).  An  expansion  for  that  factor  can  then  be  obtained  if 
g(m)  *  o(m^  ^)  for  some  TJ  >  0  by  employing  the  Taylor's 
series  for  the  exponential  function. 

Recall  from  (1.2.9a)  that  the  probabilities  P{Sm  >  g(m)}  and 

P{Z  >  $(m)}  agree  up  to  a  factor  (1  +  o(l))  whenever 
2/3 

g(m)  ■  o(m  ).  More  generally,  as  seen  below,  if 

(5.2.7)  g(m)  -  ©(m1 
with 

(5.2.8)  j  <  Tf  <  j, 

then  the  invariance  principle 

(5.2.9)  PfS^  ^  8(®)}  ■  (right  side  of  (5.2.1))  as  m  cd 
obtains,  provided 


(5.2.10) 


n  .  3  7?  -  1/3 
*  2  1/2  -  TJ 


With  TJ  ■  1/3,  as  in  (1.2.9a),  the  bound  on  n  is  0,  and  (5.2.9) 
reduces  to  (1.2.9a).  As  3}  increases  to  1/2,  the  bound  on  n 
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increases  to  oo  .  The  inequality  (5.2.10)  can  be  inverted: 

(5.2.10a)  1\  It 

For  example,  to  apply  (5.2.9)  with  n  *  1  we  must  have  i  2/5, 
i.e.,  g(m)  »  o(m3^3).  For  n  ■  2  we  require  g(m)  “  o(m^2), 
and  so  on.  If  (5.2.7)  holds  for  every  <  1/2,  e.g.,  if 

(5.2.11)  g(m)  -  jm  •  (log  m)P,  p  >  0, 

then  (5.2.9)  provides  a  complete  asymptotic  expansion  for  m:  (5.2.9) 
holds  for  each  n  1  0. 


The  proof  of  (5.2.9)  from  (5.2.10)  is  easy.  Since 
g(m)  ■  o(m2^),  i.e.,  ?(m)  ■  o(mly^),  (5.2.4)  with  n  -  1 

yields 

P{Sm  >  g(m) }  -  P{Z  >  f (m) }  •  [1  +  O(^Sl)]  •  [1  +  O(Hf-)] 

3 

-  p{z  > »(»)}  •  [i  +  0(54^-)] 

vm 

using  (5.1.2).  So  (5.2.9)  holds  if 


?3(m) 

Ji n 


o( 


f 2n(m) ' 


i.e.,  if 


n  +  2 

.  .  .  2n  +  3. 

g(m)  -  o(m  ). 


*  f 
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For  this  it  is  enough  that  1  -  £  (n  +  2)/(2n  +  3),  i.e.,  that 

(5.2.10a)  hold. 


5.3.  An  asymptotic  expansion  related  to  the  SLLN. 


The  following  theorem  (Wichura,  1980)  gives  a  complete  expansion 
for  boundary  crossing  probabilities  for  Brownian  motion. 


THEOREM  5.3.1  (Wichura).  Suppose  the  oositive  function 
g  6  U(B)  satisfies  the  growth  condition 


(5.3.1) 


as  t  t  for  some  X  >  0.  Let  n  1  0,  and  suppose  g  has 

(n  +  1)  continuous  derivatives  satisfying  the  smoothness  condition 

(5.3.2)  g^k\u)  -  g^(t)  as  u-t-^oo,  1  £  k  £  n  +  1, 

and  the  growth  condition 


(5.3.3)  g(k)(t)  =  t1/2  ‘  kf(t),  l  *  k  <  n  +  1. 
Finally,  suppose 

(5.3.4)  P{B(t)  >  g(t)  for  some  t  i  s}  ■  o(— - - ) 

f2n(s) 

as  s  od  .  Then  there  exist  functions  Aq,  A^ ,  ... 

satisfying 

(5.3.5)  \<t)  -  0( - tt-— - ),  0  <  k  t  n, 

VB  '  (t) 
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as  t  ->  as  such  that  with 


(5.3.6) 


Jk(s) 


r°°  i  i 

J  s  2^2 TTt 


-  ^?2(t) 


\(t)  dt 


we  have 


(5.3.7) 


P{B(t)  >  g(t)  for  some  t  i  s} 

poo  i  " 


S-0  Jk(s>  +  0(J  s  X  6 


XV2n  "  x(t) 


dt) 


as  s  ->  qo  .  (] 

REMARK  5.3.2.  (a)  Explicit  definitions  of  are  given  by 

Wichura  (1980);  for  example, 

(5.3.8a)  AQ  -  2g', 

(5.3.8b)  A  -  -8^-=-, 

(g  'Y 

(5.3.8c)  a,  -  -  iiSOi  +  JL^. 

L  (g  'Y  (g'r 

In  particular,  the  theorem  reduces  to  Theorem  3.1.1  in  the  case  n  *  0. 

(b)  Theorem  3.1.1  is  of  help  in  checking  (5.3.4).  Q 

As  Theorem  3.2.1  follows  from  Theorem  3.1.1,  so  Theorem  5.3.1 
yields  the  invariance  principle 


92 


THEOREM  5*3.3.  Lee  S  satisfy  the  assumptions  of  Section  1.1. 
Suppose  the  positive  function  g  6  U  satisfies  the  growth 
conditions 

(5.3.9)  t 

and 

(5.3.10)  +  0 
tl  "  M 

where 

(5.3.11)  ~  L  T\  <  i. 


Let  n  1  0,  and  suppose  g  has  (n  +  1)  continuous  derivatives 
satisfying  the  smoothness  condition  (5.3.2)  and  the  growth  condition 
(5.3.3).  Then  with  A^  and  J^’)  as  *n  Theorem  5.3.1  and  with 


Pm  given  as  usual  by  (5.1.3), 


n  r®  i  -  y<o 

(5.3.12)  P„  -  lU  V->  +  0(J  m  £  . 


v^2n  "  X(t) 


dt) 


as  m  oo  ,  provided  (5.2.10)  holds.  Q 

REMARK  5.3.4,  (a)  The  derivatives  of  g  can  be  computed  ignoring 
9,  in  the  sense  of  (5.3.3),  for  all  boundaries  g  of  practical 
Interest  (e.g.,  for  g  as  in  (1.3.9)  or  Example  1.6.3).  Condition 
(5.3.2)  is  likewise  a  mild  restriction. 
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(b)  As  always,  Che  regularity  conditions  for  g  need  only  be 
satisfied  for  large  t. 

(c)  The  smoothness  condition  (5.3.2)  can  be  weakened  to 

(5.3.13)  g(k)(u)  -  g(k)(t)  when  t,  u  oo  as  in  (2.1.5) 

provided  (3. 2. 4-6)  hold. 

(d)  Theorem  5.3.3  is  clearly  the  strong-law  counterpart  to  the 
result  (5.2.9)  of  the  previous  section.  With  *  1/3  the  bound  on 

n  in  (5.2.10)  is  0,  and  Theorem  5.3.3  reduces  to  Theorem  3.2.1.  In  a 
case  such  as  (5.2.11),  for  each  T|  <  1/2  the  boundary  g  satisfies 
(5.3.10)  for  all  large  t,  and  so  the  expansion  (5.3.12)  is  complete. 


(e)  Suppose,  in  addition  to  the  assumptions  of  Theorem  5.3.3,  that 
the  boundary  Is  also  in  the  range  of  moderate  growth,  i.e.,  that  (2.1.3) 
holds  for  some  0<S<l/6.  In  this  case  one  can  show  without 
difficulty  that 


(5.3.14) 


Tco  i  "  0*  (t)  1 

JmXe  ^j2k-l(t) 


Pm 

*2k(m) 


P{Sm  >  g(m) } 
¥2k(m) 


k  1  0. 


Provided  that  (5.3.5)  is  also  a  lower  bound,  i.e.,  that 

(5.3.15)  \(t)  =  — a-  1  - -  as  t  -»  oo  , 

7B2k  L(t) 

the  k  th  term  J^m)  in  (5.3.12)  is  also  of  order  of  magnitude 
(5.3.14).  For  k  ■  1  one  can  even  show  that 
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(5.3.16) 


Jl(o)  "  iVm 


Pm 

*2(m) 


where  (c£. 


(5.3.17) 


(5.3.8b)) 
A1  (m) 

\  "  g'(m) 


g"(m)  s  1 
(g'(m))3  *2(m)‘ 


In  particular,  if  Tj  i  2/5  in  (5.3.10),  i.e.,  if 
g(t)/t3^5  I  0,  then 

(5.3.18)  P«  [1  +  t  A  +  o(-si— )]J0(m), 
m  2  ^  ?2(m)  0 


demonstrating  that  the  relative  error  in  the  approximation 

J  S  Jn(m)  of  Theorem  3.2.1  is  in  this  case  of  the  same  order  of 
m  u 

2 

magnitude  as  1/f  (m).  Q 

To  prove  Theorem  5.3.3  one  uses  Theorem  5.3.1  and  the  invariance 
principle  of  Lemma  3.2.5  with 

(5.3.19)  a  -  2  -  f- 

The  particulars  of  the  proof  require  detailed  knowledge  of  the 
functions  A^  of  (5.3.5)  and  are  omitted. 

5.4.  An  asymptotic  upper  bound  on  the  relative  error  in  (2.1.6). 


The  following  theorem  is  a  refinement  of  Theorem  2.1.1. 
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THEOREM  5.4.1.  Suppose,  in  addition  to  the  hypotheses  of  Theorem 
2.1.1,  that  g  is  twice  differentiable  with 

(5.4.1)  g”(t)  -  0(^-)  as  t  co. 

t 

Then  we  have  the  refinement 


(5.4.2) 


‘ 11  +  0(i2fw 1  + 


of  (2.1.6).  0 


REMARK  5.4.2.  (a)  The  growth  condition  (5.4.1)  is  satisfied  in 
particular  if  the  derivatives  of  g  can  be  computed  ignoring  ?  in 
the  sense  of  (5.3.3). 


(b)  The  two-term  big-oh  bound  of  (5.4.2)  on  the  relative  error  in 

the  approximation  I  for  p_  reduces  to  the  first  term  when 

m 

2/3 

g(m)  -  0(m  )  and  to  the  second  in  the  contrasting  case 

m2^3  -  0(g(m) ) . 

(c)  For  comparison,  recall  that  the  relative  errors  of  the 

approximations  (1.2.7)  and  (1.2.8)  are  0(l>(m)/m1^2)  and 
2 

0(1 /t  (m)),  respectively.  For  (1.2.7)  we  are  ignoring  the  good 
fortune  of  (5.2.6b). 

3/5 

(d)  In  case  g(t)/t  0,  the  approximation  J  of 

n 

2 

Theorem  3.2.1  has  greater  accuracy  (relative  error  =  1/f  (m)  ) 
than  that  guaranteed  by  (5.4.2)  (relative  error 
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-  O((log  $2(m))/?2(m))  )  for  1^.  See  Remark 
5.3.4(e).  0 

PROOF  of  Theorem  5.4.1.  As  a  consequence  of  Theorem  2.1.1,  whose 
proof  we  shall  review  in  order  to  sketch  that  of  (5.4.2), 

(5.4.3)  =  [^(m)]  1exp[-  -]jj?2(m)]  as  m  as 

with  V  defined  by  (2.3.7).  It  follows  as  In  Section  2.3  that  If 
the  definition  (2.3.2)  Is  replaced  by 

(5.4.4)  v  ■  [Vl  *  pr  ^^)~| 

”  b  »2(«> 

with  0  <  8*  <  6,  then  (2. 3. 3-4)  continue  to  hold,  and 

(5.4.5)  p  -  0(— ~ — )  •  p  as  m  ->  oo  . 

vm  *2(m)  m 

Assuming  (5.4.1),  the  definition  of  6  (see  Section  2.4)  and  Taylor's 

m 

theorem  imply  the  improvement 

2 

(5.4.6)  6  -  (1  +  0(^-^„-ty  ^-) )  •  g'(m)  as  m  ao 

“  f2(m) 

to  (2.4.1). 

Now  the  proof  of  Lemma  2.4.1  shows  that  the  integral  in  (2.5.27)  is 
smaller  than  1  by  an  amount 

0(C  l/2)  -  0((?(m)/m1/2)1/2).  But  Lorden's  (1970)  method 
m 

can  be  used  to  show  that  for  any  J  >  0 
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(5.4.7) 


exP<5Em>ei(a)Ry^  *  ^.e^m) 


o.e^m) 


expCJRy) 

[(JX+  +  l)exp(?X+)]  -  1 

5Em,©1(m)X 


With  5  chosen  within  the  interval  of  finiteness  of  the  cgf  K, 
this  leads  to  the  improved  estimate 

*'a  f(m)/J? 

for  the  final  integral  in  (2.5.27).  The  relation  (2.5.30)  can  be 

developed  into  a  complete  asymptotic  expansion;  in  particular,  one  can 

show  that  the  relative  error  in  (2.5.30)  is 
2  1/2 

0(1/1  (m))  +  0(f(m)/m  ).  The  same  two-term  bound  on  the 

emr  holds  for  (2.4.9)  and  hence  for  (2.4.12).  As  (2.3.1)  can  be 
tightened  to  (5.4.5),  so  (2.4.13)  can  be  improved  to 

(5.4.8)  p  (jf  )  -  0(— sp — )  *  p  as  m  oo  . 

Vm  -  f2(m)  - 


Combining  the  various  bounds  we  arrive  at  (5.4.2).  Q 

REMARK  5.4.3.  To  get  fur  her  terms  in  an  asymptotic  expansion  for 

p  in  the  case  of  Theorem  2.1.1,  one  would  need  to  approximate  g 

m 

more  closely  than  by  the  straight  lines  l_  and  ?  of  Section  2.4, 

-HD  in 

perhaps  by  higher-order  polynomials.  One  would  then  seek  appropriate 
results  from  nonlinear  renewal  theory  to  replace  the  use  of  Lorden's 
(1970)  bound,  fl 
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